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(57) Abstract: This invention provides methods for the stable introduction of heterologous coding sequences into the genome of a 

£5 bird and expressing the coding sequences to produce desired proteins or to alter the phenotype of the bird. The present invention 
provides preferred methods for introducing a transgene into the cytoplasm of avian embryonic cells by cytoplasmic microinjection. 
^ The embryo then develops into a transgenic adult capable of expressing a heterologous protein and/or capable of generating a line 
^ of transgenic birds through breeding. Synthetic vectors and gene promoters useful in the methods are also provided by the present 
^ invention, as are transgenic birds that express heterologous protein and avian eggs containing heterologous protein. 
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PRODUCTION OF A TRANSGENIC AVIAN 
BY CYTOPLASMIC INJECTION 

5 

This application claims the benefit of United States Provisional Application No. 
60/322,969, filed September 18, 2001, and United States Provisional Application No. 
60/351,550, filed January 25, 2002, both of which are incorporated by reference herein in 
their entireties. 

10 

1. FIELD OF THE INVENTION 

The present invention relates to methods of producing a transgenic avian by 
introducing a nucleic acid encoding a heterologous protein into an avian embryo preferably 
by cytoplasmic injection, but also by other methods of introducing nucleic acids into 

1 5 embryonic cells, including but not limited to, nuclear transfer, retroviral vector infection, 
and fertilization with sperm containing the nucleic acid. The present invention further 
relates to a transgenic avian expressing a heterologous polypeptide, which, preferably, is 
deposited into the white of the avian egg. The invention further provides vectors containing 
coding sequences for heterologous proteins, the expression of which is under the control of 

20 a promoter and other regulatory elements that cause expression of the heterologous protein 
and preferably, lead to deposition of the protein in the avian egg. Also included in the 
invention are avian eggs derived from the transgenic avians and the heterologous proteins 
isolated therefrom. 

25 2. BACKGROUND 

The field of transgenics was initially developed to understand the action of a single 
gene in the context of the whole animal and the phenomena of gene activation, expression, 
and interaction. This technology has also been used to produce models for various diseases 
in humans and other animals and is amongst the most powerful tools available for the study 

30 of genetics, and the understanding of genetic mechanisms and function. From an economic 
perspective, the use of transgenic technology for the production of specific proteins or other 
substances of pharmaceutical interest (Gordon et ai t 1987, Biotechnology 5: 1 183-1 187; 
Wilmut et al, 1990, Theriogenology 33: 113-123) offers significant advantages over more 
conventional methods of protein production by gene expression. 

35 
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Heterologous nucleic acids have been engineered so that an expressed protein may 
be joined to a protein or peptide that will allow secretion of the transgenic expression 
product into milk or urine, from which the protein may then be recovered. These 
procedures have had limited success and may require lactating animals, with the attendant 
5 costs of maintaining individual animals or herds of large species, including cows, sheep, or 
goats. 

The hen oviduct offers outstanding potential as a protein bioreactor because of the 
high levels of protein production, the promise of proper folding and post-translation 
modification of the target protein, the ease of product recovery, .and the shorter 

10 developmental period of chickens compared to other potential animal species. The 
production of an avian egg begins with formation of a large yolk in the ovary of the hen. 
The unfertilized oocyte or ovum is positioned on top of the yolk sac. After ovulation, the 
ovum passes into the infundibulum of the oviduct where it is fertilized, if sperm are present, 
and then moves into the magnum of the oviduct, lined with tubular gland cells. These cells 

15 secrete the egg-white proteins, including ovalbumin, lysozyme, ovomucoid, conalbumin and 
ovomucin, into the lumen of the magnum where they are deposited onto the avian embryo 
and yolk. 

2.1 Microinjection 

20 Historically, transgenic animals have been produced almost exclusively by 

microinjection of the fertilized egg. Mammalian pronuclei from fertilized eggs are 
microinjected in vitro with foreign, i.e., xenogeneic or allogeneic, heterologous DNA or 
hybrid DNA molecules. The microinjected fertilized eggs are then transferred to the genital 
tract of a pseudopregnant female (e.g., Krimpenfort et aL, in U.S. Pat No. 5,175,384). 

25 However, the production of a transgenic avian using microinjection techniques is more 
difficult than the production of a transgenic mammal. In avians, the opaque yolk is 
positioned such that visualization of the pronucleus, or nucleus of a single-cell embryo, is 
impaired thus preventing efficient injection of the these structures with heterologous DNA- 
What is therefore needed is an efficient method of introducing a heterologous nucleic acid 

30 into a recipient avian embryonic cell. 

Cytoplasmic DNA injection has previously been described for introduction of DNA 
directly into the germinal disk of a chick embryo by Sang and Perry, 1989, Mol Reprod 
Dev. 1 : 98-106, Love et aL, 1994, Biotechnology 12: 60-3, and Naito et aL, 1994, MoL 
Reprod. Dev. 37: 167-171 ; incorporated herein by reference in their entireties. Sang and 

35 perry described only episomal replication of the injected cloned DNA, while Love et al. 
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suggested that the injected DNA becomes integrated into the cell's genome and Naito et a!. 
showed no direct evidence of integration. In all these cases, the germinal disk was not 
visualized during microinjection, i.e., the DNA was injected "blind" into the geiminal disk. 
Such prior efforts resulted in poor and unstable transgene integration. None of these 
methods were reported to result in expression of the transgene in eggs and the level of 
mosaicism in the one transgenic chicken reported to be obtained was one copy per 10 
genome equivalents. 

2.2 Retroviral Vectors 

Other techniques have been used in efforts to create transgenic chickens expressing 
heterologous proteins in the oviduct. Previously, this has been attempted by microinjection 
of replication defective retroviral vectors near the blastoderm (PCT Publication WO 
97/47739, entitled Vectors and Methods for Tissue Specific Synthesis of Protein in Eggs of 
Transgenic Hens, by MacArthur). Bosselman et al. in U.S. Patent No. 5,162,215 also 
* describes a method for introducing a replication-defective retroviral vector into a pluripotent 
stem cell of an unincubated chick embryo, and further describes chimeric chickens whose 
cells express a heterologous vector nucleic acid sequence. However, the percentage of G, 
transgenic offspring (progeny from vector-positive male G 0 birds) was low and varied 
between 1 % and approximately 8%. Such retroviral vectors have other significant 
20 limitations, for example, only relatively small fragments of nucleic acid can be inserted into 
the vectors precluding, in most instances, the use of large portions of the regulatory regions 
and/or introns of a genomic locus which, as described herein, can be useful in obtaining 
significant levels of heterologous protein expression. Additionally, retroviral vectors are 
generally not appropriate for generating transgenics for the production of pharmaceuticals 
25 due to safety and regulatory issues. 



23 Transfection of Male Germ Cells, Followed by Transfer to Recipient Testis 
Other methods include in vitro stable transfection of male germ cells, followed by 
transfer to a recipient testis. PCT Publication WO 87/05325 discloses a method of 
30 transferring organic and/or inorganic material into sperm or egg cells by using liposomes. 
Bachillerera/. (1991, Mol. Reprod. Develop. 30: 194-200) used Upofectin-based liposomes 
to transfer DNA into mice sperm, and provided evidence that the liposome transfected DNA 
was overwhelmingly contained within the sperm's nucleus although no transgenic mice 
could be produced by this technique. Nakanishi & fritani (1993, Mol. Reprod. Develop. 36: 
35 258-261) used Lipofectin-based liposomes to associate heterologous DNA with chicken 
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sperm, which were in turn used to artificially inseminate hens. There was no evidence of 
genomic integration of the heterologous DNA either in the DNA-liposome treated sperm or 

in the resultant chicks. 

Several methods exist for transferring DNA into sperm cells. For example, 
5 heterologous DNA may also be transferred into sperm cells by electroporation that creates 
temporary, short-lived pores in the cell membrane of living cells by exposing them to a 
sequence of brief electrical pulses of high field strength. The pores allow cells to take up 
heterologous material such as DNA, while only slightly compromising cell viability. Gagne 
et al. (1991, Mol. Reprod. Dev. 29: 6-15) disclosed the use of electroporation to introduce 
10 heterologous DNA into bovine sperm subsequently used to fertilize ova. However, there 
was no evidence of integration of the electroporated DNA either in the sperm nucleus or in 
the nucleus of the egg subsequent to fertilization by the sperm. 

Another method for transferring DNA into sperm cells was initially developed for 
integrating heterologous DNA into yeasts and slime molds, and later adapted to sperm, is 
15 restriction enzyme mediated integration (REND) (Shemesh et al, PCT International 

Publication WO 99/42569). REMI utilizes a linear DNA derived from a plasmid DNA by 
cutting that plasmid with a restriction enzyme that generates single-stranded cohesive ends. 
The linear, cohesive-ended DNA together with the restriction enzyme used to produce the 
cohesive ends is then introduced into the target cells by electroporauon or liposome 
20 transfection. The restriction enzyme is then thought to cut the genomic DNA at sites that 
enable the heterologous DNA to integrate via its matching cohesive ends (Schiestl and 
Petes, 1991, Proc. Natl Acad. Set. USA 88: 7585-7589). 

It is advantageous, before the implantation of the transgenic germ cells into a testis 
of a recipient male, to depopulate tbe testis of untransfected male germ cells. Depopulation 
25 ofthe testis has commonly been by exposing the whole animal to gamma irradiation by 
localized irradiation of me testis. Gamma radiation-induced spermatogonia! degeneration is 
probably related to the process of apoptosis. (Hasegawa et al:,l 998, Badiat. Res. 149:263- 
70). Alternatively, a composition containing an alkylating agent such as busulfan 
(MYLERAN™) can be used, as disclosed in Jiang FX, 1998, Anat. Embryol. 198(1):53- 
30 61; Russell and Brinster. 1996, J. Androl. 17(6):615-27; Boujrad et al, Andrologia 27(4), 
223-28 (1995); Linder et al, 1992, Reprod. Toxicol 6(6):491-505; Kasuga and Takahashi, 
1986, Endocrinol Jpn 33(1):105-15. These methods likewise have not resulted in efficient 
transgenesis or heterologous protein production in avian eggs. 
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2.5 Nuclear Transfer 

Nuclear transfer from cultured cell populations provides an alternative method of 
genetic modification, whereby donor cells may be sexed, optionally genetically modified, 
and then selected in culture before their use. The resultant transgenic animal originates 

5 from a single transgenic nucleus and mosaics are avoided. The genetic modification is 
easily transmitted to the offspring. Nuclear transfer from cultured somatic cells also 
provides a route for directed genetic manipulation of animal species, including the addition 
or "knock-in" of genes, and the removal or inactivation or "knock-out" of genes or their 
associated control sequences (Polejaeva et aL, 2000, Tlieriogenology, 53: 1 17-26). Gene 

10 targeting techniques also promise the generation of transgenic animals in which specific 
genes coding for endogenous proteins have been replaced by exogenous genes such as those 

coding for human proteins. 

The nuclei of donor cells are transferred to oocytes or zygotes and, once activated, 
result in a reconstructed embryo. After enucleation and introduction of donor genetic 

15 material, the reconstructed embryo is cultured to the morula or blastocyte stage, and 

transferred to a recipient animal, either in vitro or in vivo (Eyestone and Campbell, 1999, J 
Reprod Fertil Sappl. 54:489-97). Double nuclear transfer has also been reported in which 
an activated, previously transferred nucleus is removed from the host unfertilized egg and 
transferred again into an enucleated fertilized embryo. 

20 The embryos are then transplanted into surrogate mothers and develop to term. In 

some mammalian species (mice, cattle and sheep) the reconstructed embryos can be grown 
in culture to the blastocyst stage before transfer to a recipient female. The total number of 
ofispring produced from a single embryo, however, is limited by the number of available 
blastomeres (embryos at the 32-64 cell stage are the most widely used) and the efficiency of 

25 the nuclear transfer procedure. Cultured cells can also be frozen and stored indefinitely for 
future use. 

Two types of recipient cells are commonly used in nuclear transfer procedures: 
oocytes arrested at the metaphase of the second meiotic division (ME) and which have a 
metaphase plate with the chromosomes arranged on the meiotic spindle, and pronuclear 
30 zygotes. Enucleated two-cell stage blastomeres of mice have also been used as recipients. 
In agricultural mammals, however, development does not always occur when pronuclear 
zygotes are used, and, therefore, Mil-arrested oocytes are the preferred recipient cells. 

Although gene targeting techniques combined with nuclear transfer hold tremendous 
promise for nutritional and medical applications, current approaches suffer from several 
35 limitations, including long generation times between the founder animal and production 



WO 03/025146 PCT/US02/29878 

* ■ 

* » 

A w^tArinarv r.nsts. It is therefore desirable to 
transgenic iierua, aim crAituotvo uuouu^vu; ^» • ~ — — j 

use a system where cultured somatic cells for nuclear transfer are more efficiently 
employed. 

What is needed, therefore, is an efficient method of generating transgenic avians that 
5 express a heterologous protein encoded by a transgene, particularly in the oviduct for 
deposition into egg whites. 

3. SUMMARY OF THE INVENTION 

This invention provides methods for the stable introduction of heterologous coding 

10 sequences into the genome of a bird and expressing those heterologous coding sequences to 
produce desired proteins. Synthetic vectors and gene promoters useful in the methods are 
also provided by the present invention, as are transgenic birds that express a heterologous 
protein and avian eggs containing a heterologous protein. In a preferred embodiment, the 
vectors useful in methods of the invention are not eukaryotic viral, more preferably not 

15 retroviral, vectors (although the vectors may contain transcriptional regulatory elements, 
such as promoters, from eukaryotic viruses). In other embodiments, however, the vectors 
are eukaryotic viral vectors or are retroviral vectors. 

One aspect of the present invention is a method of producing a transgenic avian 
capable of expressing a heterologous protein. The method comprises isolating an early 

20 stage embryo from a fertilized hen, and microinjecting into the isolated embryo a selected 
nucleic acid that encodes the desired heterologous protein. The microinjected avian embryo 
is transferred to the oviduct of a recipient hen for in vivo development and to be laid as a 
shelled egg (or, alternatively, cultured ex vivo). The shelled egg is incubated to hatch a 
transgenic chick that has incorporated, preferably, integrated into its genome, the selected 

25 nucleic acid 

The present invention provides methods for introducing a transgene into the 
cytoplasm of avian embryonic cells by cytoplasmic microinjectiorL The cells may be 
embryonic cells as, for example, from a single cell embryo visualized through overlying 
yolk or tissue by using, for example, ligjit microscopy, or a camera system such as a CCD 

30 camera with a microscopic lens {e.g., as disclosed in PCT International Publication WO 
02/064727 by Christmann, which is incorporated by reference herein in its entirety). 
Microeiectroporation can optionally be used to enhance the uptake of exogenous DNA into 
the cell nucleus and improving the efficiency of DNA integration. The cytoplasmically 
microinjected embryo is then, preferably, returned to a female bird to be laid as a hard-shell 

35 egg or, as an alternative, cultured ex vivo. After hatching from the hard-shelled egg, a 
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transgenic chick is nroduced that expresses a heterologous protein and/or that can be bred to 
generate a line of transgenic birds expressing a heterologous protein. 

In alternative embodiments, the nucleic acid is introduced by infection or injection 
of the nucleic acid contained within a retroviral vector, sperm-mediated transgenesis, or 

5 nuclear transfer. 

In one embodiment, the present invention provides methods for producing 
heterologous proteins in avians. Transgenes are introduced by, most preferably, 
cytoplasmic microinjection into one embryonic cell, preferably the germinal disk of an early 
stage embryo, that then develop into a transgenic bird. The protein of interest may be 

10 expressed in the tubular gland cells of the magnum of the oviduct, secreted into the lumen, 
or deposited within the egg white onto the egg yolk or expressed, for example, in the serum 
of the bird. Such transgenic birds can also be bred to identify birds that carry the transgene 
in their germ line. The exogenous genes can therefore be transmitted to birds by both 
cytoplasmic microinjection of the exogenous gene into bird embryonic cells, and by 

1 5 subsequent stable transmission of the exogenous gene to the bird' s offspring in a Mendelian 
fashion. 

The present invention provides for a method of producing a heterologous protein in 
an avian oviduct. The method comprises, as a first step, providing a vector containing a 
coding sequence and a promoter that functions in avians, preferably in the avian magnum, ^ 

20 operably linked to the coding sequence, so that the promoter can effect expression of the 
nucleic acid in the tubular gland cells of the magnum of an avian oviduct and/or in any other 
desired tissue of the avian. In a preferred embodiment, the vector containing the transgene 
is not a eukaryotic viral vector (preferably, not a retroviral vector, such as but not limited to 
reticuloendotheliosis virus (REV), ALV or MuLV) or derived from a eukaryotic virus (but, 

25 in certain embodiments, may contain promoter and/or other gene expression regulatory 
sequences from a eukaryotic virus, such as, but not limited to, a Rous sarcoma virus viral 
promoter or a cytomegalovirus promoter). Next, the vector is introduced into avian 
embryonic cells by cytoplasmic microinjection so that the vector sequence may be randomly 
inserted into the avian genome. Finally, a mature transgenic avian that expresses the 

30 exogenous protein in its oviduct is derived from the transgenic embryonic cells or by 
breeding a transgenic avian derived from the transgenic embryonic cells. 

in particular embodiments, the level of mosaicism of the transgene (percentage of 
cells containing the transgene) in avians hatched from microinjected embryos (i.e. 9 the GqS) 
is greater than 5%, 10%, 25%, 50%, 75% or 90%, or is the equivalent of one copy per one 

35 genome, two genomes, five genomes, seven genomes or eight genomes, as determined by 
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embodiments, the percentage of G 0 s that transmit the transgene to progeny (Gjs) is greater 
than 5%, preferably, greater than 10%, 20%, 30%, 40%, and, most preferably, greater than 
50%. In other embodiments, the efficiency of transgenesis (i.e., number of G 0 s containing 
5 the transgene) is greater than 2%, 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80% or 
99%. 

This method can also be used to produce an avian egg containing an exogenous 
protein when the exogenous protein, that is expressed for example, in the tubular gland cells 
or fibroblast cells, is also secreted into the oviduct lumen and deposited, e.g., into the white 

10 of an egg. In other embodiments of the invention, the exogenous protein is expressed in the 
liver, or secreted into the blood, and deposited into the yolk. In preferred embodiments, the 
level of expression of the heterologous protein in the egg white of eggs laid by G 0 and/or G, 
chicks and/or their progeny is greater than 5 jig, 10 ^g, 50 fxg, 100 fig, 250 \ig, 500 fig, or 
750 ^g, more preferably greater than 1 mg, 2 mg, 5 mg, 10 mg, 20 mg, 50 mg, 100 mg, 200 

15 mg, 500 mg, 700 mg, 1 gram, 2 grams, 3 grams, 4 grams or 5 grams. 

The present invention further provides promoters useful for expression of the 
heterologous protein in the egg. For example, the promoter comprises regions of at least 
two promoters derived from an avian including, but not limited to, an ovomucoid, 
ovalbumin, conalbumin, lysozyme, or ovotransferrin, or any other promoter that directs 

20 expression of a gene in an avian, particularly in a specific tissue of interest, such as the 
magnum. Alternatively, the promoter used in the expression vector may be derived from 
that of the lysozyme gene that is expressed in both the oviduct and macrophages. In other 
embodiments the promoter is a viral or non-avian promoter, e.g., cytomegalovirus or Rous 
sarcoma virus promoter. In certain embodiments, the promoter is constitutive in avian cells. 

25 in other embodiments, the promoter is inducible. In particular embodiments, the gene 
regulatory sequences are flanked by matrix attachment regions (MARs), preferably, but not 
limited to those associated with the lysozyme gene in chickens or other avia. The nucleic 
acid encoding the polypeptide may be operably linked to a transcription pro^ 
transcription terminator. In other embodiments, prior to microinjection, the vector is mixed 

30 with a nuclear localization signal peptide to facilitate targeting of the injected vector to the 
nucleus. 

Other embodiments of the invention provide for transgenic avians, such as chickens 
or quail, carrying a transgene in the genetic material of their germ-line tissue, preferably 
where the transgene was not introduced into the avian genome using a eukaryotic viral 
35 promoter. The transgene incorporated into the genomic DNA of a recipient bird can encode 
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u~ p ~*ia~ fv,nt movKp fnr p.yamnla but is not limited to, a cytokine, a growth 

factor, enzyme, structural protein, immunoglobulin, or any other polypeptide of interest that 
is capable of being expressed by an avian cell or tissue. Preferably, the heteologous protein 
is a mammalian, or preferably a human, protein or derived from a mammalian, or preferably 
5 a human, protein (e.g. , a derivative or variant thereof). In particular embodiments, the 
invention provides heterologous proteins isolated or purified from an avian tissue, 
preferably serum, more preferably eggs, most preferably egg whites, and pharmaceutical 
compositions comprising such heterologous proteins. In a more preferred embodiment, the 
heterologous protein is an antibody that is human (including antibodies produced from 
10 human immunoglobulin sequences in mice or in antibody libraries or synthetically produced 
but having variable domain framework regions that are the same as or homologous to 
human framework regions) or humanized. 

The present invention further relates to nucleic acid vectors (preferably, not derived 
from eukaryotic viruses, except, in certain embodiments, for eukaryotic viral promoters and/ 
1 5 or enhancers) and transgenes inserted therein that incorporate multiple polypeptide- 
encoding regions, wherein a first polypeptide-encoding region is operatively linked to a 
transcription promoter and a second polypeptide-encoding region is operatively linked to an 
Internal Ribosome Entry Sequence (IRES). For example, the vector may contain coding 
sequences for two different heterologous proteins (e.g., the heavy and light chains of an . 
20 immunoglobulin) or the coding sequences for all or a significant part of the genomic 
sequence for the gene from which the promoter driving expression of the transgene is 
derived, and the heterologous protein desired to be expressed (&£., a construct containing 
the genomic coding sequences, including introns, of the avian lysozyme gene when the 
avian lysozyme promoter is used to drive expression of the transgene, an IRES, and the 
25 coding sequence for the heterologous protein desired to be expressed downstream (Le., 3' on 
the RNA transcript of the IRES)). Thus, in certain embodiments, the nucleic acid encoding 
the heterologous protein is introduced into the 5' untranslated or 3' untranslated regions of 
an endogenous gene, such as but not limited to, lysozyme, ovalbumin, ovotransferrin, and 
ovomucoid, with an IRES sequence directing translation of the heterologous sequence. 
30 Such nucleic acid constructs, when inserted into the genome of a bird and expressed 

therein, will generate individual polypeptides that may be post-translationally modified, for 
example, glycosylated or, in certain embodiments, be present as complexes, such as 
heterodimers with each other in the white of the avian egg. Alternatively, the expressed 
polypeptides may be isolated from an avian egg and combined in vitro, or expressed in a 
35 non-reproductive tissue such as serum In other embodiments, for example, but not limited 



-9- 



WO 03/025146 PCT/US02/29878 

pvnrpccmn nf wv. h^w and li^ht chains of an antibody is desired, two separate 
constructs, each containing a coding sequence for one of the heterologous proteins operably 
linked to a promoter (either the same or different promoters), are introduced by 
microinjection into cytoplasm of one or more embryonic cells and transgenic avians 

5 harboring both transgenes in their genomes and expressing both heterologous proteins are 
identified. Alternatively, two transgenic avians each containing one of the two heterologous 
proteins {e.g., one transgenic avian having a transgene encoding the light chain of an 
antibody and a second transgenic avian having a transgene encoding the heavy chain of the 
antibody) can be bred to obtain an avian containing both transgenes in its germline and 

1 0 expressing both transgene encoded proteins, preferably in eggs. 

In other embodiments, the present invention further provides methods for the 
introduction to an avian genome of at least one transgene encoding at least one heterologous 
polypeptide including sperm-mediated transfer where nucleic acids are incorporated into 
avian sperm by liposomes, electroporation, restriction enzyme mediated integration (REMT), 

1 5 or similar methods. The modified sperm may then be returned to the testis of a male bird 
which then may be mated with a female to generate transgenic offspring, or the modified 
sperm may be used directly to fertilize the female bird by artificial insemination to generate 

transgenic offspring. 

The present invention further provides methods for incorporating a transgene into 
20 the nucleus of an avian cell cultured in vitro including by transfection, cytoplasmic 
microinjection or pronuclear microinjection. The transgenic cell nucleus may then be 
transferred to a fertilized enucleated cell. The enucleated cell may be an embryonic cell of a 
bird egg visualized through overlying yolk or tissue by using two photon laser scanning 
microscopy. 

25 For convenience, certain terms employed in the specification, examples, and 

appended claims are collected here. 

3.1 Definitions 

The term "avian" as used herein is intended to refer to any species, subspecies or 
30 race of organism of the taxonomic class ova, such as, but not limited to, such organisms as 
chicken, turkey, duck, goose, quail, pheasants, parrots, finches, hawks, crows and ratites 
including ostrich, emu and cassowary. The term includes the various known strains of 
Gallus gallus, or chickens, (for example, White Leghorn, Brown Leghorn, Barred-Rock, 
Sussex, New Hampshire, Rhode Island, Ausstralorp, Minorca, Amrox, California Gray, 

35 
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other poultry commonly bred. 

The term "embryonic cells" as used herein refers to cells that are typically single cell 
embryos, or the equivalent thereof, and is meant to encompass dividing embryos, such as 
5 two-cell, four-cell, or even later stages as described by Eyal-Giladi and Kochav (1976, Dev. 
Biol. 49:321-337) and ova 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 12, 14, 16, 18, or 20 hours after the 
preceding lay. The embryonic cells may be isolated freshly, maintained in culture, or reside 
within an embryo. Although the present invention is generally described in terms of 
microinjection of a single-cell embryo, it should be recognized that other cells from an early 
10 stage embryo are suitable for cytoplasmic injection in the methods of the present injection. 
For example, cells obtained from a stage later than a stage I embryo, up to and including a 
stage X embryo, i.e., stages H-X, may be useful in the present invention. Chick 
developmental stages are described in the following reference, Eyal-Giladi and Kochav, 
1976, Dev. Biol. 49(2):321-37, which is hereby incorporated by reference in its entirety. 
15 The term "nucleic acid" as used herein refers to any natural and synthetic linear and 

sequential arrays of nucleotides and nucleosides, for example cDNA, genomic DNA, 
mRNA, tRNA, oligonucleotides, oligonucleosides and derivatives thereof. Representative 
examples of the nucleic acids of the present invention include bacterial plasmid vectors 
including expression, cloning, cosmid and transformation vectors such as, but not limited 
20 to, pBR322, animal viral vectors such as, but not limited to, modified adenovirus, influenza 
virus, polio virus, pox virus, retrovirus, and the like, vectors derived from bacteriophage 
nucleic acid, e.g, , plasmids and cosmids, artificial chromosomes, such as but not limited to, 
Yeast Artificial Chromosomes (YACs) and Bacterial Artificial Chromosomes (BACs), and 
synthetic oligonucleotides like chemically synthesized DNA or RNA. The term "nucleic 
25 acid" further includes modified or derivatised nucleotides and nucleosides such as, but not 
limited to, halogenated nucleotides such as, but not only, 5-bromouracil, and derivatised 
nucleotides such as biotin-labeled nucleotides. 

As used herein the terms "polypeptide'' and "protein" refer to a polymer of amino 
acids of three or more amino acids in a serial array, linked through peptide bonds. The term 
30 "polypeptide" includes proteins, protein fragments, protein analogues, oligopeptides and the 
like. The term "polypeptides" contemplates polypeptides as defined above that are encoded 
by nucleic acids, produced through recombinant technology, isolated from an appropriate 
source such as a bird, or are synthesized. The term "polypeptides" further contemplates 
polypeptides as defined above that include chemically modified amino acids or amino acids 
35 covalently or noncovalently linked to labeling ligands. 
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The term "fragment" as used herein to refers to an at least 10 ; 20, 50, 75. 100 ? 150, 
200, 250, 300, 500, 1000, 2000 or 5000 nucleotide long portion of a nucleic acid {e.g., 
cDNA) that has been constructed'artificially (e.g., by chemical synthesis) or by cleaving a 
natural product into multiple pieces, using restriction endonucleases or mechanical shearing, 

5 or enzymatically, for example, by PCR or any other polymerizing technique known in the 
art, or expressed in a host cell by recombinant nucleic acid technology known to one of skill 
in the art. The tenn "fragment" as used herein may also refer to an at least 5, 10, 20, 30, 40, 
50, 75, 100, 150, 200, 250, 300, 400, 500, 1000, 2000, or 5000 amino acid portion of a 
polypeptide, which portion is cleaved from a naturally occurring polypeptide by proteolytic 

10 cleavage by at least one protease, or is a portion of the naturally occurring polypeptide 
synthesized by chemical methods or using recombinant DNA technology (e.g., expressed 
from a portion of the nucleotide sequence encoding the naturally occurring polypeptide) 

known to one of skill in the art. 

The term "isolated nucleic acid" as used herein refers to a nucleic acid that has been 

15 removed from other components of the cell containing the nucleic acid or from other 
components of chemical/synthetic reaction used to generate the nucleic acid. In specific 
embodiments, the nucleic acid is 50%, 60%, 70%, 80%, 90%, 95%, 99% or 100% pure. 
The techniques used to isolate and characterize the nucleic acids and proteins of the present 
invention are well known to those of skill in the art and standard molecular biology and 

20 biochemical manuals may be consulted to select suitable protocols without undue 
experimentation. See, for example, Sambrook et al, 2001, Molecular Cloning: A 
Laboratory Manual, 3rd ed., Cold Spring Harbor Press; the content of which is herein 
incorporated by reference in its entirety. 

By the use of the term "enriched" in reference to nucleic acid it is meant that the 

25 specific DNA or RNA sequence constitutes a significantly higher fraction of the total DNA 
or RNA present in the cells or solution of interest than in normal or diseased cells or in the 
cells from which the sequence was taken. Enriched does not imply that there are no other 
DNA or RNA sequences present, just that the relative amount of the sequence of interest 
has been significantly increased, for example, by 1 fold, 2 fold, 5 fold, 10 fold, 50 fold, 100 

30 fold, 500 fold, 1000 fold, 10,000 fold, 100,000 fold, or 1,000,000 fold. The other DNA 
may, for example, be derived from a yeast or bacterial genome, or a cloning vector, such as 

a plasmid or a viral vector. 

The terms "transcription regulatory sequences" and "gene expression control 
regions" as used herein refer to nucleotide sequences that are associated with a gene nucleic 
35 acid sequence and which regulate the transcriptional expression of the gene. Exemplary 
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steroid response elements, negative regulatory elements, and the like. The "transcription 
regulatory sequences" may be isolated and incorporated into a vector nucleic acid to enable 
regulated transcription in appropriate cells of portions of the vector DNA. The 
5 "transcription regulatory sequence" may precede, but is not limited to, the region of a 
nucleic acid sequence that is in the region 5' of the end of a protein coding sequence that 
may be transcribed into mRNA. Transcriptional regulatory sequences may also be located 
within a protein coding region, in regions of a gene that are identified as "intron" regions, or 
may be in regions of nucleic acid sequence that are in the region of nucleic acid. 
1 0 The term "promoter' as used herein refers to the DNA sequence that determines the 

site of transcription initiation by an RNA polymerase. A "promoter-proximal element" may 
be a regulatory sequence within about 200 base pairs of the transcription start site. A 
"magnum-specific" promoter, as used herein, is a promoter that is primarily or exclusively 
active in the tubular gland cells of the avian magnum. Useful promoters also include 
15 exogenously inducible promoters. These are promoters that can be "turned on" in response 
to an exogenously supplied agent or stimulus, which is generally not an endogenous 
metabolite or cytokine. Examples include an antibiotic-inducible promoter, such as a 
tetracycline-inducible promoter, a heat-inducible promoter, a light-inducible promoter, or a 
laser inducible promoter. {e.g., Halloran et al, 2000, Development 127: 1953 -1960; Gemer . 
20 et al, 2000, Int. J. Hyperthermia 16: 171-81; Rang and Will, 2000, Nucleic Acids Res. 28: 
1 120-5; Hagihara et al., 1999, Cell Transplant 8: 4314; Huang et al., 1999, Mol. Med. 5: 
129-37- Forster et al, 1999, Nucleic Acids Res. 27: 708-10; Liu et al, 1998, Biotechniques 
24: 624-8, 630-2; the contents of which have been incorporated herein by reference in their 
entireties). 

25 To fecilitate manipulation and handling of the nucleic acid to be administered, the 

nucleic acid is preferably inserted into a cassette where it is operably linked to a promoter. 
The promoter should be capable of driving expression in the desired cells. The selection of 
appropriate promoters can be readily accomplished. For some applications, a high 
expression promoter is preferred such as the cytomegalovirus (CMV) promoter. Other 

30 promoters useful in the present invention include the Rous Sarcoma Virus (RSV) promoter 
(Davis et al, 1993, Hum. Gene Therap. 4:151). In other embodiments, all or a portion of 
the, for example, lysozyme, ovomucoid, ovalbumin, albumin, conalbumin or ovotransferrin 
promoters, which direct expression of proteins present in egg white, are used, as detailed 
infra, or synthetic promoters such as the MDOT promoter described infra. 

35 
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The tenris "operably" or "operatively linked" refer to the configuration of the coding 
and control sequences so as to perform the desired function. Thus, control sequences 
operably linked to a coding sequence are capable of effecting the expression of the coding 
sequence and regulating in which tissues, at what developmental timepoints, or in response 
5 to which signals, etc., a gene is expressed. A coding sequence is operably linked to or under 
the control of transcriptional regulatory regions in a cell when DNA polymerase will bind 
the promoter sequence and transcribe the coding sequence into mRNA that can be translated 
into the encoded protein. The control sequences need not be contiguous with the coding 
sequence, so long as they function to direct the expression thereof. Thus, for example, 
10 intervening untranslated yet transcribed sequences, can be present between a promoter 
sequence and the coding sequence and the promoter sequence can still be considered 
"operably linked" to the coding sequence. Such intervening sequences include but are not 
limited to enhancer sequences which are not transcribed or are not bound by polymerase. 

The term "expressed" or "expression" as used herein refers to the transcription from 
15 a gene to give an RNA nucleic acid molecule complementary at least in part to a region of 
one of the two nucleic acid strands of the gene. The term "expressed" or "expression" as 
used herein also refers to the translation from said RNA nucleic acid molecule to give a 
protein or polypeptide or a portion thereof. 

The term "matrix attachment region" or "MAR" as used herein refers to a DNA 
20 sequence having an affinity or intrinsic binding ability for the nuclear scaffold or matrix. 
The MAR elements of the chicken lysozyme locus are described by Phi-Van et ah, 1996, 
EM.B.O. J. 76:665-664 and Phi-Van, L. and Stratling, WM., 1996, Biochem. 35:10735- 
10742; incorporated herein by reference in their entireties. 

The term "probe" as used herein, when referring to a nucleic acid, refers to a 
25 nucleotide sequence that can be used to hybridize with and thereby identify the presence of 
a complementary sequence, or a complementary sequence differing from the probe sequence 
but not to a degree that prevents hybridization under the hybridization stringency conditions 
used. The probe may be modified with labels such as, but not only, radioactive groups, 
biotin, and the like. 

30 The term "nucleic acid vector" as used herein refers to a natural or synthetic single 

or double stranded plasmid or viral nucleic acid molecule, or any other nucleic acid 
molecule, such as but not limited to YACs, BACs, bacteriophage-derived artificial 
chromosome (BBPAG), cosmid or PI derived artificial chromosome (PAQ, that can be 
transfected or transformed into cells and replicate independently of, or within, the host cell 

35 genome. A circular double stranded vector can be linearized by treatment with an 
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acid can be inserted into a vector by cutting the vector with restriction enzymes and ligating 
the pieces together. The nucleic acid molecule can be RNA or DNA 

The term "expression vector" as used herein refers to a nucleic acid vector that 
5 comprises regulatory sequences operably linked to a nucleotide sequence coding at least one 
polypeptide. As used herein, the term "regulatory sequences" includes promoters, 
enhancers, and other elements that may control gene expression. 

The term "recombinant cell" refers to a cell that has a new combination of nucleic 
acid segments that are not covalently linked to each other in nature in that particular 
10 configuration. Anew configuration of nucleic acid segments can be introduced into an 
organism using a wide array of nucleic acid manipulation techniques available to those 
skilled in the art. A recombinant cell can be a single eukaryotic cell, such as a mammalian 
cell, or a single prokaryotic cell. The recombinant ceil may harbor a vector that is 
extragenomic. An extragenomic nucleic acid vector does not insert into the cell's genome. 
15 A recombinant cell may further harbor a vector or a portion thereof (e.g., the portion 
containing the regulatory sequences and the coding sequence) that is intragenomic. The 
term intragenomic defines a nucleic acid construct incorporated within the recombinant 
cell's genome. 

The terms "recombinant nucleic acid" and "recombinant DNA" as used herein refer v 
20 a combination of at least two nucleic acids that is not naturally found in a eukaryotic or 
prokaryotic cell in that particular configuration. The nucleic acids may include, but are not 
limited to, nucleic acid vectors, gene expression regulatory elements, origins of replication, 
suitable gene sequences that when expressed confer antibiotic resistance, protein-encoding 
sequences and the like. The term "recombinant polypeptide" is meant to include a 
25 polypeptide produced by recombinant DNA techniques such that it is distinct from a 
naturally occurring polypeptide either in its location, purity or structure. Generally, such a 
recombinant polypeptide will be present in a cell in an amount different from that normally 
observed in nature. 

As used herein, the term "transgene" means a nucleic acid sequence (encoding, for 
30 example, a human interferon polypeptide) that is partly or entirely heterologous, i.e., 

foreign, to the transgenic animal or cell into which it is introduced, or, is homologous to an 
endogenous gene of the transgenic animal or cell into which it is introduced, but which is 
designed to be inserted, or is inserted, into the animal's genome in such a way as to alter the 
genome of the cell into which it is inserted (e.g., it is inserted at a location that differs from 
35 that of the natural gene or its insertion results in a knockout). A trangene also includes a 
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expression of an endogenous coding sequence, e.g., to increase expression and or to change 
the timing and or tissue specificity of expression, etc. {e.g., to effect "gene activation"). 
As used herein, a "transgenic avian" is any avian species, including the chicken, in 
5 which one or more of the cells of the avian may contain heterologous nucleic acid 

introduced by way of human intervention, such as by transgenic techniques known in the 
art, and particularly, as describedherein. The nucleic acid is introduced into a cell, directly 
or indirectly by introduction into a precursor of the cell, by way of deliberate genetic 
manipulation, such as by microinjection or by infection with a recombinant virus. The term 
10 genetic manipulation does not include classical cross-breeding, or in vitro fertilization 
(although it does include fertilization with sperm into which a transgene has been 
introduced, but rather is directed to the introduction of a recombinant DNA molecule. This 
molecule may be integrated within a chromosome, or it may be extrachromosomally 
replicating DNA. In the typical transgenic avian, the transgene causes cells to express a 
15 recombinant form of the subject polypeptide, e.g. either agonistic or antagonistic forms, or a 
form in which the gene has been disrupted. The terms "chimeric avian" or "mosaic avian" 
are used herein to refer to avians in which the recombinant gene is found, or in which the 
recombinant is expressed in some but not all cells of the avian. The term "tissue-specific 
chimeric avian" indicates that the recombinant gene is present and/or expressed in some 

20 tissues but not others. 

The term "chromosomal positional effect (CPE)" as used herein refers to the 
variation in the degree of gene transcription as a function of the location of the transcribed 
locus within the cell genome. Random transgenesis may result in a transgene being inserted 
at different locations in the genome so that individual cells of a population of transgenic 
25 cells may each have at least one transgene, each at a different location and therefore each in 
a different genetic environment. Each cell, therefore, may express the transgene at a level 
specific for that particular cell and dependant upon the immediate genetic environment of 
the transgene. In a transgenic avian, as a consequence, different tissues may exhibit 
different levels of transgene expression. 

30 The term "cytokine" as used herein refers to any secreted polypeptide that affects the 

functions of cells and is a molecule that modulates interactions between cells in the 
immune, inflammatory or hematopoietic response. A cytokine includes, but is not limited 
to, monokines and lymphokines regardless of which cells produce them. For instance, a 
monokine is generally referred to as being produced and secreted by a mononuclear cell, 

35 such as a macrophage and/or monocyte. Many other cells however also produce monokines, 
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such as natural killer cells, fibroblasts, basophils, neutrophils; endothelial cells, brain 
astrocytes, bone marrow stromal cells, epideral keratinocytes and B-lymphocytes. 
Lymphokines are generally referred to as being produced by lymphocyte cells. Examples of 
cytokines include, but are not limited to, Interleukin-1 (IL-1), Interleukin-6 (IL-6), 
5 Interleukin-8 (EL-8), Tumor Necrosis Factor-alpha (TNF-alpha) and Tumor Necrosis Factor 
beta (TNF-beta). 

The term "antibody" as used herein refers to polyclonal and monoclonal antibodies 
and fragments thereof, and immunologic binding equivalents thereof. The term "antibody 
refers to a homogeneous molecular entity, or a mixture such as a polyclonal serum product 
10 made up of a plurality of different molecular entities, and may further comprise any 

modified or derivatised variant thereof that retains the ability to specifically bind an epitope. 
A monoclonal antibody is capable of selectively binding to a target antigen or epitope. 
Antibodies may include, but are not limited to polyclonal antibodies, monoclonal antibodies 
(mAbs), humanized or chimeric antibodies, camelized single chain antibodies (scFvs), Fab 
15 fragments, F(ab') 2 fragments, disulfide-linked Fvs (sdFv) fragments produced by a Fab 
expression library, anti-idiotypic (anti-Id) antibodies, intrabodies, synthetic antibodies, and 
epitope-binding fragments of any of the above. 

The term "immunoglobulin polypeptide" as used herein refers to a polypeptide 
derived from a constituent polypeptide of an immunoglobulin. An "immunoglobulin 
20 polypeptide" may be, but is not limited to, an immunoglobulin (preferably an antibody) 
heavy or light chain and may include a variable region, a diversity region, joining region and 
a constant region or any combination, variant or truncated form thereof. The term 
_iunoglobulin polypeptides" further includes single-chain antibodies comprised of, but 
not limited to, an immunoglobulin heavy chain variable region, an immunoglobulin light 
25 cham variable region and optionally a peptide linker. 

The term "male germ cells" as used herein refers to spermatozoa (i.e., male gametes) 
and developmental precursors thereof. In fetal development, primordial germ cells are 
thought to arise from the embryonic ectoderm, and are first seen in the epithelium of the 
. endodermal yolk sac at the E8 stage. From there Ihey migrate through the hindgut 
30 endoderm to the genital ridges. In the sexually mature male vertebrate animal, there are 
several types of cells that are precursors of spermatozoa, and which can be genetically 
modified, deluding the primitive spermatogonia! stem cells, known as AO/As, which 
differentiate into type B spermatogonia. The latter further differentiate to form primary 
spermatocytes, and enter a prolonged meiotic prophase during which homologous 
35 chromosomes pair and recombine. Useful precursor cells at several 
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morphological/developmental stages are also distinguishable: prelcptotene spermatocytes, 
leptotene spermatocytes, zygotene spermatocytes, pachytene spermatocytes, secondary, 
spermatocytes, and the haploid spermatids. The latter undergo further morphological 
changes during spermatogenesis, including the reshaping of their nucleus, the formation of 
5 aerosome, and assembly of the tail. The final changes in the spermatozoon (i.e., male 
gamete) take place in the genital tract of the female, prior to fertilization. 

The terms "ovum" and "oocyte" are used interchangeably herein. Although only 
one ovum matures at a time, an animal is bom with a finite number of ova. In avian 
species, such as a chicken, ovulation, which is the shedding of an egg from the ovarian- 
10 follicle, occurs when the brain's pituitary gland releases a luteinizing hormone. Mature 
follicles form a stalk or pedicle of connective tissue and smooth muscle. Immediately after 
ovulation the follicle becomes a thin-walled sac, the post-ovulatory follicle. The mature 
ovum erupts from its sac and starts its journey through the oviduct. Eventually, the ovum 
enters the infundibulum where fertilization occurs. Fertilization must take place within 15 
15 minutes of ovulation, before the ovum becomes covered by albumen. During fertilization, 
sperm (avians have polyspennic fertilization) penetrate the blastodisc. When the sperm 
lodges within this germinal disk, an embryo begins to form as a "blastoderm" or "zygote." 

The term "donor cell" is used herein to describe the source of the nuclear structure 
that is transplanted to the recipient enucleated pytoplast. All cells of normal karyotype, 
20 including embryonic, fetal, and adult somatic cells, preferably in a quiescent state, may be 
nuclear donors. The use of non-quiescent cells as nuclear donors has been described by 
Cibeffi, et al, 1998, Science 280: 1256-8. 

This application uses gene nomenclature accepted by the Cucurbit Genetics 
Cooperative as it appears in the Cucurbit Genetics Cooperative Report, 1995, 18:85; herein 
25 incorporated by reference in its entirety. Using this gene nomenclature, genes are 

symbolized by italicized Roman letters. If a mutant gene is recessive to the normal type, 
then the symbol and name of the mutant gene appear in italicized lower case letters. 

3.2 Abbreviations 

30 Abbreviations used in the present specification include the following: aa, amino 

acid(s); bp, base pair(s); cDNA, DNA complementary to RNA; nt, nucleotides); SSC, 
sodium chloride-sodium citrate; MAR, matrix attachment region; DMSO, dimethyl 
sulfoxide; TPLSM, two photon laser scanning microscopy; REMI, restriction enzyme 
mediated integration; mAb, monoclonal antibody, WEFs, whole embryo fibroblasts. 

35 
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FIGS. 1A-E illustrate the nucleotide sequence (SEQ ID NO: 6) comprising the 
chicken lysozyme gene expression control region (SEQ ID NO: 7), the nucleotide sequence 
encoding the chicken expression optimized human interferon a2b (IFNMAGMAX; SEQ ID 
5 NO: 5) and a SV40 polyadenylation signal sequence (SEQ ID NO: 8). 

FIG. 2 illustrates the nucleotide sequence SEQ ID NO: 5 encoding the chicken 
expression optimized human interferon o2b (IFNMAGMAX). 

10 FIGS. 3A-E illustrate the nucleotide sequence SEQ ID NO: 7 encoding the chicken 

lysozyme gene expression control region. 

JIG. 4 illustrates the nucleotide sequence SEQ ID NO: 8 encoding the SV40 
polyadenylation signal sequence. 

15 

FIGS. 5 A-C illustrate the nucleotide sequence SEQ ID NO: 9 encoding the chicken 
lysozyme 3* domain. 

JIGS. 6A-J illustrate the nucleotide sequence SEQ ID NO: 10 encoding the 
20 lysozyme gene expression control region (SEQ ID NO: 7) linked to the nucleic acid insert 
SEQ ID NO: 5 encoding the chicken expression-optimized human interferon a2b 
(IFNMAGMAX) and the chicken lysozyme 3' domain SEQ ID NO: 9. 

FIG. 7 illustrates the results of the PCR analysis of chick blood DNA. Lanes 4 and 5 
25 and lanes 1 1 and 12 contain PCR products from blood DNA collected from bird #8305. 

FIG. 8 illustrates the results of ELBA for human IFN a2b in transgenic hen serum. 
8307 and AA59 are serum samples collected from negative control birds. Numbers on top 
of the bars represent the number of days after hatching that the serum was collected. 

30 

FIG. 9 illustrates the results of ELBA for human IFN o2b in transgenic hen egg 
white. Three eggs from each hen were assayed. 

FIG. 10 illustrates the results of SDS-PAGE analysis of human IFN-a2b purified 
35 from the pooled egg whites obtained from transgenic chicken AVI-029. 1, molecular 
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weight markers; 2, transfeiriri/avidin markers; 3, ovalbumin/lysozyme markers; 4, 
ovoglobulins; 5, pooled egg white; 6, solubilized egg white; 7, cation exchange Pool #1; 8, 
cation exchange Pool #2; 9, HIC pool. 

. * 

5 FIG. 1 1 illustrates the results of a Western blot analysis of the protein contents of 

fractions from the purification of human IFN-a2b purified from the pooled egg whites 
obtained from transgenic chicken AVI-029. 1, HIC pool (artifact); 2, HIC pool; 3, cation 
exchange Pool #2; 4, cation exchange Pool #1; 5, solubilized egg white; 6, pooled egg 
white; 7, ovoglobulins; 8, ovalbumin/lysozyme markers; 9, transferrin/avidin markers; 10, 

1 0 molecular weight markers. 

FIG. 12 illustrates the glycosylation analysis of IFN-a2b purified from the pooled 
egg whites obtained from transgenic chicken AVI-029. 

15 FIG. 13 compares the identities and relative proportions of glycosylated side-chains 

of human and transgenic chicken human IFN-a2b. 

FIG. 14 illustrates the nucleic acid sequence SEQ ID NO: 11 of the combinatorial 
promoter MDOT. 

20 

FIGS. 15A-B illustrate the oligonucleotides and primers (SEQ ID NOS: 17-34) used 
in the formation of the chicken codon optimized human interferon a2b-encoding nucleic 
acid. 

25 FIG. 16 illustrates the levels of expression of human <x2b in eggs as determined by 

ELBA. 

FIG. 17 illustrates the bioactivity versus the mass of human interferon a2b in G 2 hen 
egg whites. 

30 

FIG. 18 illustrates interferon serum levels in chicks producing human interferon 

a2b. 

FIG. 19 illustrates the presence of a pLNHX-MD OT-IFN transgene in chicks. 
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FIG- 20 illustrates the presence of a pLNHX-MDOT-IFN transgene in chicks. 

FIG. 21 illustrates the production of human interferon hy quail oviduct cells 
transfected with pAVIJCR-Al 15.93.12. 

5 

FIG. 22 illustrates the primers (SEQ ID NOS: 38-41) used in the synthesis of the 
MDOT promoter. 

FIG. 23 illustrates the induction of human interferon <x2b by hormonally treated 
10 transfected cells. 

5. DETAILED DESCRIPTION OF THE INVENTION 
The present invention relates to methods of introducing nucleic acids into avian 
embryonic cells to produce a transgenic chicken, or other avian species, carrying the 
1 5 transgene in the genetic material in all or most of its tissue, including germ-line tissue. The 
methods and vectors of the present invention further generate transgenic avians that express 
heterologous genes in the serum of the avian and/or are deposited into an avian egg, 
preferably in the egg white. Vectors containing promoters that direct high level of 
expression of the heterologous protein in the avian, particularly in the magnum for 
20 deposition into the avian egg are provided. Additional regulatory elements, such as MAR's, 
IRES's, enhancers, polyadenlyation signals, etc., may be included in the vectors of the 
invention to improve expression and efficiency. 

Using the methods of the invention, transgenic avians that express significant 
quantities of useful heterologous proteins, e.g., therapeutic and diagnostic proteins, 
25 including immunoglobulins, industrially useful proteins and other biologies etc. in the avian 
egg white are produced. The heterologous protein can then be readily purified from the 
avian egg. The methods of the invention provide improved efficiencies of transgenesis, 
transmission of the transgene and/or level of heterologous protein expression. 

The transgenic avians of the invention are most preferably generated using 
30 cytoplasmic microinjection of nucleic acid into avian embryonic cells. Other methods 
contemplated by the invention include sperm-mediated transgenesis, nuclear transfer and 
injection or infection with a retroviral vector. Once the nucleic acid has been introduced 
into the embryo (or ovum which is then fertilized in vitro), the embryo is preferably 
returned to the avian using ovum transfer or, alternatively, is cultured ex vivo. 

35 
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5.1 METHODS OF TRANS GENESIS 

5.1.1 CYTOPLASMIC INJECTION 

The present invention provides methods of introducing nucleic acids containing a 

5 transgene, preferably, nucleic acid vectors of the invention as described in Section 5.2, 
infra, into an embryonic avian cell or an avian ovum by microinjection into the cell. In 
preferred embodiments, the nucleic acid is introduced by microinjection into the cytoplasm 
of the cell; however, in other embodiments of the invention, the nucleic acid is introduced 
into a nucleus or pronucleus, or is deposited in the perinuclear space. 

10 In the method of the present invention, fertilized ova, and preferably stage I 

embryos, are isolated from euthanized hens between forty-five minutes and four hours after 
oviposition of the previous egg. It is, however, contemplated that the methods of the 
present invention maybe applied to recipient cells of other stages of embryonic 
development such as stage I-X, as described by Eyal-Giladi and Kochav (1976, Dev. Biol. 

1 5 49:321-337). Alternatively, eggs may be isolated from hens whose oviducts have been 
fistulated as described by Gilbert and Woodgush,1963, J. ofReprod. and Fertility 5: 451- 
453 and Pander et al, 1989, Br. Poult Sci. 30: 953-7; incorporated herein in their entireties. 
Also, unfertilized eggs can be injected by in-vitro fertilization performed by any method 
knowii in the art, for example, but not limited to, the method of Tanaka et al., 1 994, /. 

20 Reproa\ Fertility 100:447-449 (the content of which is incorporated herein in its entirety). 

In particular, microinjection into the germinal disk can be accomplished as described 
in Example 6.1, infra. Briefly, once the fertilized ovum or embryo has been obtained, the 
albumen capsule is optionally removed and the ovum placed in a dish with the germinal 
disk facing upwards. Remnants of the albumen capsule may be removed from over the 

25 germinal disk if necessary and/or desired. Phosphate buffered saline (PBS) or any other 
appropriate physiological solution may be added to the dish to prevent drying of the ovum. 

Preferably, prior to microinjection, the surface of the embryo is visualized using a 
lateral imaging system described previously (International Patent Publication WO 
02/064,727), this system allows precise imaging of the injection site and facilitates accurate 

30 needle placement and injection within the germinal disk of the recipient embryo. 

hi one embodiment, allowing the visualization of the embryo's pronuclear or 
nuclear structures, a dye such as MrroTRACKER® (300 nM, Molecular Probes catalog 
number M-75 10), can be added to the cylinder. Other dyes, such as D API (4", 6 n - 
diamidino-2-phenylindole hydrochloride), HOECHST® 33342 (bis-benzunide), or Syto 59, 
35 can also be used in methods of the invention. Visualization generally is performed after 
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approximately 20 minutes of incubation. Imaging using the MlTOTRACKER® dye shows 
intense labeling of the region around the nucleus while the nucleus itself does not take up 
the dye. This allows localization of the embryo's nuclear structures for injection while not 
causing excessive damage to its structure since the content of the pronuclei are not labeled 
and therefore are not bleached during imaging. The nucleic acid solution (generally 1-100 
nanoliters) is then injected into the cytoplasm or, alternatively, into the pronucleus or 
perinuclear space. 

Any suitable microinjection assembly and methods for microinjecting and 
reimplanting avian eggs are contemplated as useful in the method of cytoplasmic injection 

10 of the present invention. A particularly suitable apparatus and method for use in the present 
invention is fully described in U.S. Patent Application No: 09/919,143 by Christmann and 
PCT Publication WO 02/064727, incorporated herein by reference in their entireties. The 
microscope/micromanipulation unit maybe an IM-16 microinjector and a MM-188NE 
micromanipulator, both from MKON®/NARISfflGE, adapted to an upright Nikon Eclipse 

15 E800 microscope adapted to operate under both transmitted and reflected light conditions. 
This unique configuration allows the loading of a DNA solution into a micropipette while 
observing the pipette with a dry or water immersion lenses under diascopic illumination or 
transmitted light. Pipette loading is followed by the prompt localization and positioning of 
the germinal disk under the microscope and subsequent guided injection of DNA solution 

20 into the germinal disk using dry or water-immersion lenses under fiber optic, as well as 
episcopic, iUumination (through the objectives and onto the embryo surface). 

In certain embodiments, the microinjected cell will also be subjected to 
microelectroporation. The application of electrical current, e.g., microelectroporation, 
enhances the uptake of exogenous DNA fragments by cultured cells and the uptake of 

25 nucleic acids in the cytoplasm of a cell into the nucleus. Enhancement of nuclear uptake of 
the heterologous DNA will promote earlier chromosomal integration of the exogenous 
DNA molecules, thus reducing the degree of genetic mosaicism observed in transgenic 
avian founders. 

Accordingly, in specific embodiments, a sample of nucleic acid will be 
30 microinjected using the methods described immediately above, and then, delivered to a 
recipient cell nucleus by microelectroporation. In a system suitable for use in 
microelectroporating early stage avian cells, a cathode will be located within the lumen of 
the DNA delivery micropipette. Alternatively, the cathode electrode may be located on the 
exterior surface of the micropipette. For either option, the electrode is situated close or 
35 adjacent to the exit orifice of me pipette so that the electrode and the micropipette may be 
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introduced into the recipient cell together. Alternatively, the micropipette will be 
introduced into the cytoplasm and used to guide a cathode to make electrical contact with 

the cytoplasm of the targeted cell. 

In one .arrangement of the electrodes of the microelectroporation system, the anode 
5 is located on the micropipette and, therefore, will enter the cell or cells with the 

micropipette and the cathode. In another arrangement, an anode is in electrical contact with 
the solution that surrounds the targeted recipient early stage avian cell. In yet another 
version, the anode is individually positioned within the cytoplasm, or the nucleus, of the 
recipient cell. The anode and cathode are electrically connected to an electrical pulse 
10 generator capable of delivering a timed electrical pulse to the electrodes. One suitable 
apparatus for generating a timed electrical pulse according to the present invention is a 
Kation Scientific Iontaphorsis pump BAB-500 or ECM 830 manufactured by BTX®. After 
microinjection of the nucleic acid, the recipient cell will be pulsed at least once with about 
0.1 to about 20.0 microamps for about 0.1 to about 60 sees. 
1 5 After injection and, optionally, microelectroporation, the embryo is allowed to 

proceed through the natural in vivo cycle of albumen deposition and hard-shell formation. 
In preferred embodiments, the embryo is surgically transferred into the infundibulum of a 
recipient hen, where it is allowed to move into the infundibulum and into the anterior 
magnum by gravity feed, such that the recipient hen produces a hard shell egg that is 
20 incubated to produce a transgenic chick See, e.g., Olsen and Neher, 1948, /. Exp. Zoo 109: 
355-366, which is incorporated by reference in its entirety. The transgenic embryo is then 
laid as a hard-shell egg and may be incubated to hatch a transgenic chick. In an alternate 
embodiment of the present invention, the injected embryo is transferred into the oviduct of a 
recipient hen, a soft-shell egg is collected between 12 and 24 hours after ovum transfer by 
25 injecting the hen with sufficient oxytocin to induce ovipositioning. The soft shell egg can 
subsequently be incubated, and a chick hatched, using an in-vitro culture system as, for 
example, that described by Perry in U.S. Patent No. 5,01 1,780 (the contents of which is 
incorporated herein in its entirety). In either case, the hatched chick may be allowed to 
attain sexual maturity whereupon it can be used, for example, to breed new generations of 
30 heterozygous or homozygous transgenic progeny. Sexually mature female transgenic avians 
are particularly useful for the expression of a heterologous nucleic acid to yield a 
heterologous polypeptide in the white of an egg. 

The hatched chick can then be tested for presence of the transgene and/or expression 
of the heterologous protein encoded by the transgene using methods well known in the art. 
35 in a particular embodiment, blood cells of the hatched chick are screened using methods 



-24- 



WO 03/025146 PCT AJS02/29878 

disclosed in United States Patent No. 6,423,488, issued July 3, 2002, which- is hereby 
incorporated by reference in its entirety. 

5.1.2 TRANSGENESIS OF BLASTODERMAL CELLS 
5 In alternative embodiments, a transgene can be introduced into avian embryonic 

blastodermal cells, to produce a transgenic chicken, or other avian species, that carries the 
transgene in the genetic material of its germ-line tissue. The methods and vectors of the 
present invention further generate transgenic avians capable of expressing heterologous 
genes in the serum of the avian and /or deposited in an avian egg. The blastodermal cells 
10 are typically stage VE-XSL cells, or the equivalent thereof, and preferably are near stage X. 
The cells useful in the present invention include embryonic germ (EG) cells, embryonic 
stem (ES) cells & primordial germ cells (PGCs). The embryonic blastodermal cells may be 
isolated freshly, maintained in culture, or reside within an embryo. 

A variety of vectors useful in carrying out the methods of the present invention are 
15 described herein, in Section 5.2 infra. These vectors may be used for stable introduction of 
an exogenous coding sequence into the genome of a bird. In alternative embodiments, the 
vectors maybe used to produce exogenous proteins in specific tissues of an avian, and in 
the oviduct in particular. In still further embodiments, the vectors are used in methods to 
produce avian eggs which contain exogenous protein. 
20 In some cases, introduction of a vector of the present invention into the embryonic 

blastodermal cells is performed with embryonic blastodermal cells that are either freshly 
isolated or in culture. The transgenic cells are then typically injected into the subgerminal 
cavity beneath a recipient blastoderm in an egg. In some cases, however, the vector is 
delivered directly to the cells of a blastodermal embryo. 
25 In one embodiment of the invention, vectors used for transfecting blastodermal cells 

and generating random, stable integration into the avian genome contain a coding sequence 
and a magnum-specific promoter in operational and positional relationship to express the 
coding sequence in the tubular gland cell of the magnum of the avian oviduct The 
magnum-specific promoter may optionally be a segment of the ovalbumin promoter region 
30 which is sufficiently large to direct expression of the coding sequence in the tubular gland 
cells. Other exemplary promoters include the promoter regions of the ovalbumin, lysozyme, 
conalbumin, ovomucoid, or ovomucin genes. Alternatively, the promoter may be a promoter 
that is largely, but not entirely, specific to the magnum, such as the lysozyme promoter. 
Other suitable promoters may be artificial constructs such as a combination of nucleic acid 
35 regions derived from at least two avian gene promoters. One such embodiment of the 
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present invention is the MDOT construct comprising regions derived from the chicken 
ovomucin and ovotransferrin promoters 

Id an alternative embodiment of the invention, transgenes containing constitutive 
promoters are used, but the transgenes are engineered so that expression of the transgene 

5 effectively becomes magnum-specific. Thus, a method for producing an exogenous protein 
in an avian oviduct provided by the present invention involves generating a transgenic avian 
that bears two transgenes in its tubular gland cells. One transgene comprises a first coding 
sequence operably linked to a constitutive promoter. The second transgene comprises a 
second coding sequence that is operably linked to a magnum-specific promoter, where 

10 expression of the first coding sequence is either directly or indirectly dependent upon the 
cellular presence of the protein expressed by the second coding sequence. 

Optionally, site-specific recombination systems, such as the Cre-/oxP or FLP-FRT 
systems, are utilized to implement the magnum-specific activation of an engineered 
constitutive promoter. In one embodiment, the first transgene contains an FRT-bounded 

15 blocking sequence which blocks expression of the first coding sequence in the absence of 
FTP, and the second coding sequence encodes FTP. In another embodiment, the first 
transgene contains a tacP-bounded blocking sequence which blocks expression of the first 
coding sequence in the absence of the Cre enzyme, and the second coding sequence encodes 
Cre. The foxP-bounded blocking sequence may be positioned in the 5' untranslated region 

20 of the first coding sequence and the foxP-bounded sequence may optionally contain an open 
reading frame. 

For instance, in one embodiment of the invention, magnum-specific expression is 
conferred on a constitutive transgene, by linking a cytomegalovirus (CMV) promoter to the 
coding sequence of the protein to be secreted (CDS). The 5' untranslated region (UTR) of 

25 the coding sequence contains a teP-bounded blocking sequence. The foxP-bounded 

blocking sequence contains two laxP sites, between which is a start codon (ATG) followed 
by a stop codon, creating a short, nonsense open reading frame (ORF). Note that the loxP 
sequence contains two start codons in the same orientation. Therefore, to prevent them 
from interfering with translation of the coding sequence after loxP excision, the loxP sites 

30 must be orientated such that the ATGs are in the opposite strand. 

In the absence of Cre enzyme, the cytomegalovirus promoter drives expression of a 
small open reading frame (ORF). Ribosomes will initiate at the first ATG, the start codon 
of the ORF, then terminate without being able to reinitiate translation at the start codon of 
the coding sequence. To be certain that the coding sequence is not translated, the first ATG 

35 is out of frame with the coding sequence's ATG. If the Cre enzyme is expressed in cells 
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containing the CMV-cDNA transgene, the Cre enzyme will recombine the loxP sites, 
excising the intervening ORF. Translation will begin at the start codon of the coding 
sequence, resulting in synthesis of the desired protein. 

To make this system tissue specific, the Cre enzyme is expressed under the control 
5 of a tissue-specific promoter, such as the magnum-specific ovalbumin promoter, in the same 
cell as the CMV-ZaxP-coding sequence transgene. Although a truncated ovalbumin 
promoter may be fairly weak, it is still tissue-specific and will express sufficient amounts of 
the Cre enzyme to induce efficient excision of the interfering ORF. In fact, low levels of 
recombinase should allow higher expression of the recombinant protein since it does not 
10 compete against coding sequence transcripts for translation machinery. 

Alternate methods of blocking translation of the coding sequence include inserting a 
transcription termination signal and/or a splicing signal between the lox? sites. These can 
be inserted along with the blocking ORF or alone. In another embodiment of the invention, 
a stop codon can be inserted between the lox? sites in the signal peptide of the coding 
15 sequence. Before recombinase is expressed, the peptide terminates before the coding 
sequence. After recombinase is expressed (under the direction of a tissue specific 
promoter), the stop codon is excised, allowing translation of the coding sequence. The loxP 
site and coding sequence are juxtaposed such that they are in frame and the loxP stop 
codons are out of frame. Since signal peptides are able to .accept additional sequence 
20 (Brown et aL, Mol Gen. Genet 197:351-7 (1984)), insertion of loxP or other recombinase 
target sequences (i.e. FRT) is unlikely to interfere with secretion of the desired coding 
sequence. In one expression vector, the loxP site is present in the signal peptide such that 
the amino acids encoded by loxP are not present in the mature, secreted protein. Before Cre 
enzyme is expressed, translation terminates at the stop codon, preventing expression of p- 
25 lactamase. After recombinase is expressed (only in magnum cells), the lox? sites recombine 
and excise the first stop codon. Therefore, p-lactamase is expressed selectively only in 
magnum cells. 

In the aforementioned embodiments, the blocking ORF can be any peptide that is 
not harmful to chickens. The blocking ORF can also be a gene that is useful for production 
30 of the ALV-transduction particles and/or transgenic birds. In one embodiment, the blocking 

ORF is a marker gene. 

For instance, the blocking ORF could be the neomycin resistance gene, which is 
required for production of transduction particles. Once the transgene is integrated into the 
chicken genome, the neomycin resistance gene is not required and can be excised. 

35 
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Alternatively. 6-lactamase can be used as the blocking ORF as it is an useful marker 
for production of transgenic birds. (For specific examples of the use of p-lactamase as a 
marker in transgenic birds, see Example 22, below.) As an example, the blocking ORF is 
replaced by ^-lactamase and the downstream coding sequence now encodes a secreted 
5 biopharmaceutical. P-Lactamase will be expressed in blood and other tissues; it will not be 
expressed in the magnum after magnum-specific expression of Cre and recombination- 
mediated excision of ^-lactamase, allowing expression of the desired protein. 

The Cre and lox? transgenes could be inserted into the chicken genome via mediated 
transgenesis either simultaneously or separately. Any method of transgenesis that results in 

10 stable integration into the chicken genome is suitable including, but not limited to, viral 
integration and sperm-mediated integration. Both the ovalbumin promoter-recombinase and 
CMV-/orP-CDS transgenes could be placed simultaneously into chickens. However, the 
efficiencies of transgenesis are low and therefore the efficiency of getting both transgenes 
into the chicken genome simultaneously is low. In an alternative and preferred method, one 

1 5 flock is produced that carries the magnum-specific promoter/recombinase transgene and a 
second is produced that carries the CMV-/oxP-CDS transgene. The flocks would then be 
crossed to each other. Hens resulting from this outbreeding will express the coding 
sequence and only in their magnum. 

As mentioned above, the vectors produced according to th-i methods of the invention 

20 may optionally beprovided with a3'UTR containing a pqlyadenylation site to confer, 
stability to the RNA produced. In a preferred embodiment, the 3' UTR may be that of the 
exogenous gene, or selected fiom the group consisting of the ovalbumin, tysozyme, or SV40 
late region. However, the ovalbumin 3' UTR is not suitable in a PMGI vector that is to be 
inserted into the endogenous ovalbumin gene because the addition of ovalbumin sequences 

25 to the PMGI vector will interfere with proper targeting. 



5.13 VIRAL HOST CELL TRANSFORMATION 
In another embodiment, a method of introducing a nucleic acid comprising a nucleic 
acid sequence encoding one of the subject polypeptides and the associated gene expression 
30 control regions into a cell is using of a viral vector containing nucleic acid, e.g. a cDNA, 
encoding the gene product Infection of cells with a viral vector has the advantage that a 
large proportion of the targeted cells can receive the nucleic acid. Additionally, molecules 
encoded within the viral vector, e.g., by a cDNA contained in the viral vector, are expressed 
efficiently in cells that have taken up viral vector nucleic acid 
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Retrovirus vectors and adeno-associated virus vectors are generally understood to be 
the recombinant gene delivery system of choice for the transfer of heterologous genes in 
vivo. These vectors provide efficient delivery of genes into cells, and the transferred nucleic 
acids are stably integrated into the chromosomal DNA of the host. Recombinant retrovirus 

5 can be constructed wherein the retroviral coding sequences (gag, pol 3 env) have been 

replaced by nucleic acid encoding a polypeptide, thereby rendering the retrovirus replication 
defective. Protocols for producing recombinant retroviruses and for infecting cells in vitro 
or in vivo with such viruses can be found in Current Protocols in Molecular Biology, 
Ausubel et al, (1989) (eds.) Greene Publishing Associates, Sections 9.10-9.14 and other 

10 standard laboratory manuals. Examples of suitable retroviruses include pLJ, pZIP, pWE 
and pEM which are well known to those skilled in the art. Examples of suitable packaging 
virus lines for preparing both ecotropic and amphotropic retroviral systems include psiCrip, 

. psiCre, psi2 and psiAm. 

Furthermore, it is possible to limit the infection spectrum of retroviruses and 

15 consequently of retroviral-based vectors, by modifying the viral packaging proteins on the 
surface of the viral particle (see, for example PCT publications WO 93/25234, WO , 
94/06920, and WO 94/1 1524). For instance, strategies for the modification of the infection 
spectrum of retroviral vectors include coupling antibodies specific for cell surface antigens 

, to the viral env protein (Roux et a/., 1989, Proc. Natl Acad. Sci. 86: 9079-90E3; Julan et 

20 al 9 J. Gen. Virol 73: 3251-3255 (1992); and Goud et al, 1993, Virology 163: 251-254 ); or 
coupling cell surface ligands to the viral env proteins (Neda et al , 1991, J. Biol Chenu 266, 
14143-14146), and which are incorporated herein by reference in their entireties. Coupling 
can be in the form of the chemical cross-linking with a protein or other variety (e.g. lactose 
to convert the env protein to an asialoglycoprotein), as well as by generating fusion proteins 

25 ( e .g. single-chain antibody/env fusion proteins). This technique, while useful to limit or 
otherwise direct the infection to certain tissue types, can also be used to convert an 
ecotropic vector into an amphotropic vector. Moreover, use of retroviral gene delivery can 
be further enhanced by the use of tissue- or cell-specific transcriptional regulatory sequences 
that control expression of the nucleic acid encoding an irnmunoglobulin polypeptide of the 

30 retroviral vector. 

One retrovirus for randomly introducing a transgene into the avian genome is the 
replication-deficient ALV retrovirus. To produce an appropriate ALV retroviral vector, a 
pNLB vector is modified by inserting a region of the ovalbumin promoter and one or more 
exogenous genes between the 5' and 3' long terminal repeats (LTRs) of the retrovirus 

35 genome. Any coding sequence placed downstream of the ovalbumin promoter will be 
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expressed at high levels and only in the tubular gland cells of the oviduct magnum because 
the ovalbumin promoter drives the high level of expression of the ovalbumin protein and is 
only active in the oviduct tubular gland cells. While a 7.4 kb ovalbumin promoter has been 
found to produce the most active construct when assayed in cultured oviduct tubular gland 

5 cells, the ovalbumin promoter must be shortened for use in the retroviral vector. In a 
preferred embodiment, the retroviral vector comprises a 1.4 kb segment of the ovalbumin 
promoter; a 0.88 kb segment would also suffice. 

Any of the vectors of the present invention may also optionally include a coding 
sequence encoding a signal peptide that will direct secretion of the protein expressed by the 

10 vector's coding sequence from the tubular gland cells of the oviduct. This aspect of the 
invention effectively broadens the spectrum of exogenous proteins that may be deposited in 
avian eggs using the methods of the invention. Where an exogenous protein would not 
otherwise be secreted, the vector bearing the coding sequence is modified to comprise a 
DNA sequence comprising about 60 bp encoding a signal peptide from the lysozyme gene. 

15 The DNA sequence encoding the signal peptide is inserted in the vector such that it is 
located at the N4errninus of the protein encoded by the cDNA. 

Construction of one vector is reported in Example 19, below, ^lactamase may be 
expressed from the CMV promoter and utilizes a poly adenylation signal (pA) in the 3' long 
terminal repeat (LTR). p-Lactamase has a natural signal peptide; thus, it is found in blood 

20 and in egg white. 

Avian embryos have been successfully transduced with pNLB-CMV-BL 
transduction particles (see Examples 1 1 and 12, below). The egg whites of eggs from the 
resulting stably transduced hens were found to contain up to 20 mg of secreted, active (3- 
lactamase per egg (see Examples 13 and 14, below). 

25 Another viral gene delivery system useful in the present invention utilizes 

adenovirus-derived vectors. The genome of an adenovirus can be manipulated such that it 
encodes a gene product of interest, but is inactivated in terms of its ability to replicate in a 
normal lytic viral life cycle (see, for example, Berkner et al. BioTechniques 6, 616 (1988); 
■ Rosenfeld etal Science 252, 43 1434 (1991); aniRosenfeld etol. Cell 68, 143-155(1992)), 

30 incorporated herein by reference in their entireties. Suitable adenoviral vectors derived 
from the adenovirus strain Ad type 5 dl324 or other strains of adenovirus (e.g., Ad2, Ad3, 
Ad7 etc.) are well known to those skilled in the art. The virus particle is relatively stable 
and amenable to purification and concentration, and as above, can be modified so as to 
affect the spectrum of infectivity. Additionally, introduced adenoviral DNA (and foreign 

35 DNA contained therein) is not integrated into the genome of a host cell but remains 
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enisomal. therebv avoiding potential problems that can occur as a result of insertional 
mutagenesis in situations where introduced DNA becomes integrated into the host genome 
(e.g., retroviral DNA). Most replication-defective adenoviral vectors currently in use and 
therefore favored by the present invention are deleted for all or parts of the viral El and E3 
5 genes but retain as much as 80% of the adenoviral genetic material (see, for example, Jones 
et aL, (1979) Cell 16, 683; Berkner et aL, supra; and Graham et aL, in Methods in 
Molecular Biology, E. /. Murray, (1991) Ed. (Humana, Clifton, NJ.) vol. 7. pp. 109-127), 
and which are incorporated herein by reference in their entireties. Expression of an inserted 
nucleic acid encoding a polypeptide such as IFNMAGMAX, an immunoglobulin, EPO, 
10 GM-CSF, can be under control of, for example, the lysozyme promoter, the ovalbumin 
promoter, artificial promoter construct sequences and the like. 

Yet another viral vector system useful for delivery of, for example, the subject 
nucleic acid encoding an immunoglobulin polypeptide, is the adeno-associated virus 
(AAV). Vectors containing as little as 300 base pairs of AAV can be packaged and can 
. 15 integrate. Space for heterologous DNA is limited to about 4.5 kb. An AAV vector such as 
that described in Tratschin et aL, Mol. Cell. Biol. 5, 3251-3260 (1985) can be used to 
introduce DNA into cells. A variety of nucleic acids have been introduced into different 
cell types using AAV vectors (see, for example, Hermonat et aL, Proc. Natl. Acad. Sci. 81, 
6466-6470 (19*4); Tratschin et aL, Mol. Cell. Biol. 4, 2072-2081 (1985); Wondisford et aL, 
20 Mol. Endocrinol. 2, 32-39 (1988); Tratschin et aL, L ViroL 51, 611-619 (1984); and Flotte 
et aL, J. Biol. Chern. 268, 3781-3790 (1993)), incorporated herein by reference.in their 
entireties. 

Other viral vector systems that may have application in the methods according to the 
present invention have been derived from, but are not lirnited to, herpes virus, vaccinia 
25 virus, avian leucosis virus and several RNA viruses. 

5.1.4 GENERATION OF TRANSGENIC AVIAN ZYGOTES BY 
NUCLEAR TRANSFER AND TPLSM 

In another embodiment, transgenes may be introduced into the ovum of an animal, 
30 according to the present invention, by nuclear transfer via two-photon visualization and 
ablation, wherein the nuclear donor contains a desired heterologous DNA sequence in its 
genome. One of ordinary skill in the art will be able to readily adapt conventional methods 
to insert the desired transgene into the genome of the nuclear donor prior to injection of the 
nuclear donor into the recipient cytoplast, or prior to fusion of the nuclear donor cell with 
35 the recipient cell. For example, a vector that contains one or more transgene(s) encoding at 
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least one polypeptide chain of an antibody, may be delivered into the nuclear donor cell 
through the use of a delivery vehicle. The transgene is then transferred along with the 
nuclear donor into the recipient ovum. Following zygote reconstruction, the ovum is 
transferred into the reproductive tract of a recipient hen. In one embodiment of the present 
5 invention, the ovum is transferred into the infundibulum of the recipient hen. After 
reconstruction, the embryo containing the transgene develops inside the recipient hen and 
travels through the oviduct thereof where it is encapsulated by natural egg white proteins 
and a natural egg shell. The egg is laid and can be incubated and hatched to produce a 
transgenic chick. The resulting transgenic chick will carry one or more desired transgene(s) 
10 in its germ line. Following maturation, the transgenic avian may lay eggs that contain one 
or more desired heterologous protein(s) that can be easily harvested. 

In another embodiment of the present invention, a nuclear donor cell is transfected 
with a vector construct that contains a transgene encoding at least one polypeptide chain. 
Methods for transfection of somatic cell nuclei are well known in the art and include, by 
15 way of example, the use of retroviral vectors, retrotransposons, adenoviruses, adeno- 
associated viruses, naked DNA, lipid-mediated transfection, electroporation and direct 
injection into the nucleus. Such techniques, particularly as applied to avians, are disclosed 
in Bosselman (U.S. Patent No. 5,162,215), Etches (PCT Pubtication No. WO 99/10505), 
Hodgson (U.S. Patent No. 6,027,722), Hughes (U.S. Patent No. 4,997,763), Ivarie (PCT 
20 Publication No. WO 99/19472), MacArthur (PCT Pubhcation No. WO 97/47739), Perry 
(U.S. Patent No. 5,01 1,780), Petitte (U.S. Patent Nos. 5,340,740 and 5,656,749), and 
Simkiss (PCT Pubhcation No. WO 90/11355), the disclosures of which are incorporated by 

reference herein in their entireties. 

Nuclear transfer allows the cloning of animal species, wherein individual steps are 

25 common to the procedures of embryonic, fetal and adult cell cloning. These steps include, 

. but are not limited to, preparation of a cytoplast, donor cell nucleus (nuclear donor) 
isolation and transfer to the cytoplast to produce a reconstructed embryo, optional 
reconstructed embryo culture, and embryo transfer to a synchronized host animal. 

The present invention may use this approach to nuclear transfer in animals by 

30 employing two-photon visualization. In embodiments of the invention, the recipient animal 
is an avian including, but not limited to, chickens, ducks, turkeys, quails, pheasants and 
ratites. In this method, a fertilized or unfertilized egg is removed from an animal and 
manipulated in vitro, wherein the genetic material of the egg is visualized and removed and 
the ablated nucleus replaced with a donor nucleus. Optionally, the donor nucleus may be 

35 genetically modified with, for example, a transgene encoding an immunoglobulin 
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polypeptide. Two-photon laser scanning microscopy (TPLSM) may be used to visualize the 
nuclear structures. Following visualization, the nucleus in the recipient cell, such as a 
fertilized or unfertilized egg, is removed or ablated, optionally using TPLSM. 

TPLSM is based on two-photon excited fluorescence in which two photons collide 

5 simultaneously with a fluorescent molecule. Their combined energy is absorbed by the 
fluorophore, inducing fluorescent emission that is detected by a photomultiplier tube and 
converted into a digital image. See Squirrell et al, Nature Biotechnol. 17, 763-7, (1999) 
and Piston et aU Trends Cell Biol. 9, 66-9, (1999) incorporated herein by reference in their 
entireties. TPLSM generates images of living, optically dense structures for prolonged 

10 periods of time, while not affecting their viability. TPLSM utilizes biologically innocuous 
pulsed near-infrared light, usually at a wavelength of about 700 nm to about 1000 ran, 
which is able to penetrate deep into light-scattering specimens. TPLSM may employ 
different lasers, such as a mode-locked laser, where the wavelength is fixed, or a tunable 
laser that can be tuned to wavelengths between about 700 nm and about 1000 nm, 

15 depending upon the range of emission of the dye used. For DAPI and Hoescht 33342 dyes, 
720-770 nm is preferred. New fluorophores are being produced with different ranges of 
emission and the invention is not limited to the presently available dyes and their respective 
emission ranges. 

Furthennore, lasers used in T2LSM can be grouped into femtosecond and 
20 picosecond lasers. These lasers are distinguished by their pulse duration. A femtosecond 
laser is preferred since it is particularly suitable for visualization without harming the 
specimen. 

TPLSM produces noninvasive, three-dimensional, real-time images of the optically 
dense avian egg. Visualization of the metaphase plate or pronucleus in avian eggs during 

25 nuclear transfer has been prevented by the yolk. Two-photon imaging with femtosecond 
lasers operating in the near infrared, however, allows visualization of nuclear structures 
without damaging cellular constituents. Prior to visualization, specimens may be incubated 
or injected with DNA-specific dyes such as DAPI (4', 6'-dianud^o-2-phenyiindole 
hydrochloride) or Hoescht 33342 (bis^benziinide), the albumen capsule is removed and the 

30 ovum placed in a dish with the genninal disk feeing the top. Remnants of the albumen 
capsule are removed from the top of the germinal disk. 

An aqueous solution, for example phosphate-buffered saline (PBS), is added to 
prevent drying of the ovum. A cloning cylinder is placed around the germinal disk and 
DAPI in PBS is added to the cylinder. Alternatively, a DAPI-PBS solution may be injected 

35 into the germinal disk with a glass pipette, whereupon the dye enters the nuclear structures. 
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For dve injection, removal of the albumen capsule is not necessary, whereas injection of 

w* — 

nuclei into the disk is facilitated in the absence of the capsule. 

Images of the inside of the early avian embryo can be generated through the use of 
TPLSM. Visualization may be performed after about 10 to 15 minutes of incubation or 
5 about 10 minutes after dye injection. During visualization, the germinal disk is placed 
under the microscope objective and the pronuclear structures are searched within the central 
area of the disk using relatively low laser powers of about 3-6 milliwatts. Once the 
structures are found they may be ablated by using higher laser power or mechanically 

removed, guided by TPLSM. 

1 0 Nuclear transfer also requires the destruction or enucleation of the pronucleus before 

a nuclear donor can be introduced into the oocyte cytoplast. Two-photon laser-mediated 
ablation of nuclear structures provides an alternative to microsurgery to visualize the 
pronucleus lying about 25um beneath the ovum's vitelline membrane within the germinal 
disk. Higher laser powers than those used for imaging are used for enucleation, with 

15 minimal collateral damage to the cell. The wavelength for ablation generally ranges from 
about 700 nm to 1000 nm, at about 30 to about 70 milliwatts. TPLSM and two-photon 
laser-mediated ablation are more efficient than alternative methods because they are less 
operator dependent and less invasive, which results in improved viability of the recipient 
cell. 

20 A nucleus from a cultured somatic cell (nuclear donor) may then be injected into the 

0 

enucleated recipient cytoplast by a micromanipulation unit comprising a microinjector and a 
micromaiiipulator. The donor nucleus is introduced into the germinal disk though guided 
injection using episcopic mumination (Le., light coming through the objective onto the 
sample). Alternatively, a donor cell may be fused to the recipient cell using methods well 

25 known in the art, e.g. by means of fusion-promoting chemicals, such as polyethylene glycol, 
inactivated viruses, such as Sendai virus, or electrical stimulation. The reconstructed zygote 
may then be surgically transferred to the oviduct of a recipient hen to produce a hard shell 
egg. Alternatively, the reconstructed embryo may be cultured for 24 hours and screened for 
development prior to surgical transfer. 

30 The egg can be harvested after laying and before hatching of a chick, or further 

incubated to generate a cloned chick, optionally genetically modified. The cloned chick 
may carry a transgene in all or most of its cells. After maturation, the transgenic avian may 
lay eggs that contain one or more desired, heterologous protein(s). The cloned chick may 
also be a knock-in chick expressing an alternative phenotype or capable of laying eggs 

35 
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having an heterologous protein therein. The reconstructed egg may also be cultured to term 
using the ex ovo method described by Perry et ai (supra). 

• ■ « 

5.1.5 ZYGOTE RECONSTRUCTION BY OVUM TRANSFER 

* " » 

5 Another embodiment of the invention provides for a method of producing a cloned 

animal comprising nuclear transfer in combination with ovum transfer. Two-photon 
visualization and ablation may be used to perform nuclear transfer, as described above. 
Accordingly, the replacement of the recipient cell's nucleus with the donor cell's nucleus 
results in a reconstructed zygote. Preferably, pronuclear stage eggs are used as recipient 

10 cytop lasts already activated by fertilization. Alternatively, unactivated metaphase H eggs 
may serve as recipient cytoplast and activation induced after renucleation. The ovum may 
be cultured via ovum transfer, wherein the ovum containing the reconstructed zygote is 
transferred to a recipient hen. The ovum is surgically transferred into the oviduct of the 
recipient hen shortly after oviposition. This is accomplished according to normal husbandry 

15 procedures (oviposition, incubation, and hatching; see Tanaka et aL, supra). 

Alternatively, the ovum may be cultured to stage X prior to transfer into a recipient 
hen. More specifically, reconstructed stage I embryos are cultured for 24-48 hours to stage 
X. This allows for developmental screening of the reconstructed embryo prior to surgical 
transfer. Stage I embryos are enclosed within a thick albumen capsule. In this novel 

20 procedure, the albumen capsule is removed, after which the nuclear donor is injected into 
the germinal disk. Subsequently, the capsule and germinal disk are recombined by placing 
the thick capsule in contact with the germinal disk on top of the yolk. Embryos develop to 
stage X at similar rates as those cultured with their capsules intact. At stage X, the embryo 
is transferred to the oviduct of a recipient hen. 

25 Once transferred, the embryo develops inside the recipient hen and travels through 

the oviduct of the hen where it is encapsulated by natural egg white proteins and a natural 
egg shell. The egg which contains endogenous yolk and an embryo from another hen, is 
laid and can then be incubated and hatched like a normal chick. The resulting chick may 
carry a transgene in all or most of its cells. Preferably, the transgene is at least in the 

30 oviduct cells of the recipient chick. Following maturation, the cloned avian may express a 
desired phenotype or may be able to lay eggs that contain one or more desired, heterologous 
protein(s). 
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5.1.6 SPERM-MEDIATED INTEGRATION OF HETEROLOGOUS 
TRANSGENES 

Detailed descriptions of methods of sperm-mediated transfer of nucleic acid suitable 
for use in the present invention are described in the PCT Publication WO 00/697257, 

5 incoiporated herein by reference in its entirety. The first method of incorporating 
heterologous genet^-^terial into the genome of an avian delivers a nucleic acid using 
known gene dehverysystems to male genn cells in situ in the testis of the male avian (e.g., 
by in vivo transfection or transduction). The second, in vitro, method of incorporating 
heterologous genetic material into the genome of an avian involves isolating male germ 

1 o cells ex corpora, delivering a polynucleotide thereto and then returning the transfected cells 
to the testes of a recipient male bird. 

In vivo method 

The in vivo method employs injection of the gene delivery mixture, preferably into 
1 5 the seminiferous tubules, or into the pete testis, and most preferably into the vas efferens or 
vasa efferentia, using, for example, a micropipette and a picopump delivering a precise 
measured volume under controlled amounts of pressure. A small amount of a suitable, non- 
toxic dye can be added to the gene delivery mixture (fluid) to confirm delivery and 
dissemination to the seminiferous tubules of the testis. The genetically modified germ cells 
20 differentiate in their own milieu. Progeny animals exhibiting the nucleic acid's integration 
into its germ cells (transgenic animals) are selected. The selected progeny can then be 
mated, or their sperm utilized for insemination or in vitro fertilization to produce further 
generations of transgenic progeny. 
In vitro method 

25 Male germ cells are obtained or collected from the donor male bird by any means 

known in the art such as, for example, transection of the testes. The germ cells are then 
exposed to a gene delivery mixture, preferably within several hours, or cryopreserved for 
later use. When the male genn cells are obtained from the donor vertebrate by transection 
of the testes, the cells can be incubated in an enzyme mixture known for gently breaking up 

30 the tissue matrix and releasing undamaged cells such as, for example, pancreatic trypsin, 
collagenase type I, pancreatic DNAse type I, as well as bovine serum albumin and a 
modified DMEM medium. After washing the cells, they can be placed in an incubation 
medium such as DMEM, and die like, and plated on a culture dish for genetic modification 
by exposure to a gene delivery mixture. 

35 Whether employed in the in vivo method or in vitro method, the gene delivery 

mixture, once in contact with the male germ cells, facilitates the uptake and transport of 
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heterologous genetic material into the appropriate cell location for integration into the 
genome and expression. A number of known gene delivery methods can be used for the 
uptake of nucleic acid sequences into the cell. Such methods include, but are not limited to 
viral vectors, liposomes, electroporation and Restriction Enzyme Mediated Integration 

5 (REMI) (discussed below). In both the in vivo or in vitro method, a gene delivery mixture 
typically comprises a polynucleotide encoding the desired trait or product (for example, 
immunoglobulin polypeptides) and a suitable promoter sequence such as, for example, a 
tissue-specific promoter, an IRES or the like and optionally agents that increase the uptake 
of or comprise the polynucleotide sequence, such as liposomes, retroviral vectors, 

10 adenoviral vectors, adenovirus enhanced gene delivery systems and the like, or 

combinations thereof. A reporter construct, including a generic selection marker, such as 
the gene encoding for Green Fluorescent Protein, can further be added to the gene delivery 
mixture. Targeting molecules, such as the c-kit ligand, can be added to the gene delivery 
mixture to enhance the transfer of genetic material into the male germ cell. An 

l5 immunosuppressing agent, such as cyclosporin or a corticosteroid may also be added to the 
gene delivery mixture as known in the art. 

Any of a number of commercially available gene delivery mixtures can be used, to 
which the polynucleotide encoding a desired trait or product is further admixed. The final 
. gene delivery mixture comprising the polynucleotide can then be admixed with the cells and 

20 allowed to interact for a period of between about 2 hours to about 16 hours, at a temperature 
of between about 33 °C to about 37 °C. After this period, the cells are preferably placed at a 
lower temperature of about 33 °C to about 34°C, for about 4 hours to about 20 hours, 

preferably about 16 to 18 hrs. 

Isolating and/or selecting genetically transgenic germ cells (and transgenic somatic 

25 cells, and of transgenic vertebrates) is by any suitable means, such as, but not limited to, 
physiological and/or morphological phenotypes of interest using any suitable means, such as 
biochemical, enzymatic, immunochemical, histologic, electrophysiologic, biometric or like 
methods, and analysis of cellular nucleic acids, for example the presence or absence of 
specific DNAs or RNAs of interest using conventional molecular biological techniques, 

30 including hybridization analysis, nucleic acid amplification including, but not limited to, 
polymerase chain reaction, transcription-mediated amplification, reverse transcriptase- 
mediated ligase chain reaction, and/or electrophoretic technologies. 

A preferred method of isolating or selecting male germ cell populations comprises 
obtaining specific male germ cell populations, such as spermatogonia, from a mixed 

35 population of testicular cells by extrusion of the cells from the seminiferous tubules and 
enzyme digestion. The spermatogonia, or other male germ cell populations, can be isolated 
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from a mixed cell population by methods such as the utilization of a promoter sequence that 
is specifically or selectively active in cycling male germ line stem cell populations. Suitable 
promoters include B-Myb or a specific promoter, such as the e-kit promoter region, c-raf-1 
promoter, ATM (ataxia-telangiectasia) promoter, vasa promoter, RBM (ribosome binding 
5 motif) promoter, DAZ (deleted in azoospermia) promoter, XRCC- 1 promoter, HSP 90 
(heat shock gene) promoter, cyclin Al promoter, or FRME (from Fragile X site) promoter 
and the like. A selected promoter may be linked to a reporter construct, for example, a 
construct comprising a gene encoding Green Fluorescent Protein (or EGFP), Yellow 
Fluorescent Protein, Blue Fluorescent Protein, a phycobiliprotein, such as phycoerythrin or 
10 phycocyanin, or any other protein which fluoresces under suitable wave-lengths of fight, or 
encoding a light-emitting protein, such as luciferase or apoaequorin. The unique promoter 
sequences drive the expression of the reporter construct only during specific stages of male 
geim cell development (e.g., Mailer et al, J. Biol. Chem. 276(16), 11220-28 (1999); 
Schrans-Stassen et al, Endocrinology 140, 5894-5900 (1999)) incorporated herein by 
1 5 reference in their entireties. In the case of a fluorescent reporter construct, the cells can be 
sorted with the aid of, for example, a FACS set at the appropriate wavelength(s), or they can 

be selected by chemical methods. 

Male germ cells that have the DNA modified in the desired manner are isolated or 
selected, and transferred to the testis of a suitable recipient animal. Further selection can be 

20 attempted after biopsy of one or both of the recipient male's testes, or after examination of 
the animal's ejaculate amplified by the polymerase chain reaction to confirm that the desired 
nucleic acid sequence had been incorporated. 

The genetically modified germ cells isolated or selected as described above are 
preferably transferred to a testis of a recipient male avian, preferably a chicken, that can be, 

25 but need not be, me same donor animal. Before transferring the genetically modified male 
germ cells to the recipient animal, the testes of the recipient can be depopulated of 
endogenous geim cells, thereby facilitating the colonization of the recipient testis by the 
genetically modified germ cells, by any suitable means, including by gamma irradiation, by 
chemical treatment, by means of infectious agents such as viruses, or by autoimmune 

30 depletion or by combinations thereof, preferably by a combined treatment of the vertebrate 
with an alkylating agent and gamma irradiation. 

The basic rigid architecture of the gonad should not be destroyed, nor significantly 
damaged. Disruption of tubules may lead to impaired transport of testicular sperm and 
result in infertility. Sertoli cells should not be irreversibly damaged, as they provide a base 

35 for development of the germ cells during maturation, and for preventing the host immune 
defense system from destroying grafted foreign spermatogonia. 
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In a preferred method, a cytotoxic alkylating agent, such as, but not limited to, 
bisulfan (1,4-butanediol dimethanesuiphonate), chlorambucil, cyclophosphamide, 
melphalan, or ethyl ethanesulfonic acid, is combined with gamma irradiation, to be 
administered in either sequence. The dose of the alkylating agent and the dose of gamma 
5 radiation are in an amount sufficient to substantially depopulate the testis. The alkylating 
agent can be administered by any pharmaceutical^ acceptable delivery system, including 
but not limited to, intraperitoneal, intravenous, or intramuscular injection, intravenous drip, 
implant, transdermal or transmucosal delivery systems. 

The isolated or selected genetically modified germ cells are transferred into the 
10 recipient testis by direct injection using a suitable micropipette. Support cells, such as 
Leydig or Sertoli cells, that can be unmodified or genetically modified, can be transferred to 
a recipient testis along with the modified germ cells. 

A union of male and female gametes to form a transgenic zygote is brought about by 
copulation of the male and female vertebrates of the same species, or by in vitro or in vivo 
15 artificial means. If artificial means are chosen, then incorporating into the genome a genetic 
selection marker that is expressed in male germ cells is particularly useful. 

Suitable artificial means include, but are not limited to, artificial insemination, in 
vitro fertilization (IVF) and/or other artificial reproductive technologies, such as 
intracytoplasmic sperm injection (ICSI), subzonal imemination (SUZI), or partial zona 
20 dissection (PZD). Also others, such as cloning and embryo transfer, cloning and embryo 
splitting, and the like, can be employed. 

The transgenic vertebrate progeny can, in turn, be bred by natural mating, artificial 
insemination, or by in vitro fertilization (IVF) and/or other artificial reproductive 
technologies, such as intracytoplasmic sperm injection (ICSI) and chicken intracytoplasmic 
25 sperm injection (CHICS!™), subzonal insemination (SUZI), or partial zona dissection 

(PZD), to obtain further generations of transgenic progeny. Although the genetic material is 
originally inserted solely into the germ cells of a parent animal, it will ultimately be present 
in the germ cells of future progeny and subsequent generations thereof. In addition, the 
genetic material will also be present in cells of the progeny other than germ cells, i.e., 
30 somatic cells. 

517 GENERATION OF TRANSGENIC AVIAN ZYGOTES BY 
RESTRICTION ENZYME-MEDIATED INTEGRATION (REMI) 

The REM method for stably integrating heterologous DNA into the genomic DNA 
of a recipient cell is described by Shemesh et al in PCT Publication No. WO 99/42569 and 
35 incorporated herein by reference in its entirety. This REMI method comprises in part an 
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adaptation of the REMI technique disclosed by Schiest and Petes (Proc. Nat. Acad. Sci. 
U.S.A. 88, 7585-7589 (1991)) and Kuspa and Loomis (Proc. Nat Acad. Sci. U.S.A., 89, 
8803-8807 (1992)) both incorporated herein by reference in their entireties. 

The REMI method is suitable for introducing heterologous DNA into the genome 

5 nucleic acid of sperm and sperm precursor cells, or ovum, embryonic cell, or somatic cell of 
an animal, preferably an avian, more preferably a chicken. 

The heterologous nucleic acid to be integrated into, for example, the sperm nuclear 
DNA is converted to a linear double stranded DNA possessing single-stranded cohesive 
ends by contacting the heterologous DNA with a type II restriction enzyme that upon 

10 scission, generates such ends. The nucleic acid to be cut can be a circular nucleic acid such 
as in aplasmid or a viral vector or a linear nucleic acid that possesses at least one 
recognition and cutting site outside of the genes or regulatory regions critical to the desired 
post-integration function of the nucleic acid, and no recognition and cutting sites within the 
critical regions. 

1 5 Alternatively the heterologous DNA to be integrated into the sperm nuclear DNA 

can be prepared by chemically and/or enzymatically adding cohesive ends to a linear DNA 
(see, for example Sambrook et al, Molecular Cloning: A Laboratory Manual, 3rd ed., Cold 
Spring Harbor Laboratory, Cold Spring Harbor, New York (2001) incorporated herein by 
reference in its entirety). The added cohesive ends must be able to hybridize to the cohesive 

20 ends characteristic of a nucleic acid cleaved by a type II restriction endonuclease. 

Alternatively the cohesive ends can be added by combining the methods based on type II 
restriction enzyme cutting and chemical and / or enzymatic addition. 

According to the present invention, a heterologous nucleic acid encoding at least one 
polypeptide, and the appropriate restriction enzyme can be introduced into sperm cells 

25 together or sequentially by way of; for example, electroporation, or lipofection. Preferably 
electroporation may be used, and most preferably lipofection is used. However, the present 
invention contemplates that any technique capable of transferring heterologous material into 
sperm could be used so long as the technique preserves enough of the sperm's motility and 
fertilization functions, such that the resultant sperm will be able to fertilize the appropriate 

30 oocytes. It is understood that the heterologous nucleic acid may be integrated into the 
genome of a recipient cell such as a spermatogonia! cell or a spermatogonia! precursor cell 
for subsequent transfer to an embryo or the testicular material of the recipient male animal, 
preferably a chicken. It is further understood that the heterologous nucleic acid may not be 
integrated into the genome of the recipient cell. 

35 The combination ofREMI as described in the present application, plus a relatively 

benign method of tmisferring heterologous material into a cell may result in heterologous 
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nucleic acid being stably integrated into genomic DNA of a high fraction of the treated 
sperm, while not diminishing to any great extent, the viability of the sperm or their ability to 
fertilize oocytes. Examples of suitable methods for the introduction of the genetically 
modified sperm, spermatogonial cells or precuror spermatogonial cells into a recipient 
5 avian, preferably a chicken, are as described above. 

5.1.8 BREEDING AND MAINTENANCE OF TRANSGENIC 
AVIANS 

A union of male and female gametes from transgenic birds generated by the 
10 cytoplasmically microinjected embryos, thereby forming a transgenic zygote, is brought 
about by copulation of the male and female vertebrates of the same species, or by in vitro or 
in vivo artificial means. Suitable artificial means include, but are not limited to, artificial 
insemination, in vitro fertilization (IVF) and/or other artificial reproductive technologies, 
such as intracytoplasmic sperm injection (ICSI), subzonal insemination (SUZT), or partial 
15 zona dissection (PZD). Also others, such as cloning and embryo transfer, cloning and 
" embryo splitting, and the like, can be employed. 

The transgenic avian progeny can, in turn, be bred by natural mating, artificial 
insemination, or by in vitro fertilization (IVF) and/or other artificial reproductive 
technologies, such as intracytoplasmic sperm injection (ICSI) and chicken intracytoplasmic 
20 sperm injection (CfflCSl™), subzonal insemination (SUZI), or partial zona dissection 
(PZD), to obtain further generations of transgenic progeny. 

Using the methods of the invention for producing transgenic avians, particularly 
methods using vectors that are not derived from eukaryotic viruses, and, preferably, the 
.methods of cytoplasmic micro-injection described herein, the level of mosaicism of the 
25 transgene (percentage of cells containing the transgene) in avians hatched from 

microinjected embryos (i.e, the Gqs) is greater than 5%, 10%, 25%, 50%, 75% or 90%, or is 
the equivalent of one copy per one genome, two genomes, five genomes, seven genomes or 
eight genomes, as determined by any number of techniques known in the art and described 
infra. In additional particular embodiments, the percentage of GqS that transmit the 
30 transgene to progeny (G t s) is greater than 5%, preferably, greater than 10%, 20%, 30%, 
40%, and, most preferably, greater than 50%, 60%, 70%, 80%, 90%. In other embodiments, 
the transgene is detected in 10%, 20%, 30%, 40%, and most preferably, greater than 50%, 
60%, 70%, 80%, 90% of chicks hatching from embryos into which nucleic acids have been 
introduced using methods of the invention. 
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5.2 VECTORS 

A variety of vectors useful in carrying out the methods of the present invention are 
described herein. These vectors may be used for stable introduction of a selected 
heterologous polypeptide-coding sequence (and/or regulatory sequences) into the genome of 
5 an avian, in particular, to generate transgenic avians that produce exogenous proteins in 
specific tissues of an avian, and in the oviduct in particular, or in the serum of an avian. In 
still further embodiments, the vectors are used in methods to produce avian eggs containing 
exogenous protein. 

In particular embodiments, preferably for use in the microinjection, sperm-mediated 

10 transgenesis, and nuclear transfer methods described herein, the vectors of the invention are 
not derived from eukaryotic viral vectors or retroviral vectors (except in certain 
embodiments for containing eukaryotic viral regulatory elements such as promoters, origins 
of replication, etc). In particular embodiments, the vector is not an REV, ALV or MuLV 
vector. In particular, useful vectors include, bacteriophages such as lambda derivatives, 

15 such as Xgtl 1, Xgt WES.tB, Charon 4, and plasmid vectors such as pBR322, pBR325, 
P ACYC177, pACYC184, pUC8, pUC9, pUCIS, P UC19, pLG339, P R290, pKC37, 
. pKClOl, SV40, PBLUESCRIPT® E SK+/- or KS +/- (see "Stratagene Cloning Systems" 
Catalog (1993) from STRATAGENE®, La Jolla, Calif., which is hereby incorporated by 
reference), pQE, pIH821, pGEX, pET series (sec Studier, F.W. et. al, 1990, "Use of T7 

20 RNA Polymerase to Direct Expression of Cloned Genes" Gene Expression Technology 185, 
which is hereby incorporated by reference) and any derivatives thereof; cosmid vectors and, 
in preferred embodiments, artificial chromosomes, such as, but not limited to, YACs, 
BACs, BBPACs or P ACs. Such artificial chromosomes are useftd in that a large nucleic 
acid insert can be propagated and introduced into the avian cell. 

25 In other particular embodiments, as detailed above in section 5.2, infra, the vectors 

of the invention are derived from eukaryotic viruses, preferably avian viruses, and can be 
replication competent or, preferably, replication deficient. In particular embodiments, the 
vectors are derived from REV, ALV or MuLV. Nucleic acid sequences or derivative or 
truncated variants thereof; may be introduced into viruses such as vaccinia virus. Methods 

30 for mnlnnQ a viral recombinant vector useful for expressing a protein under the control of 
the lysozyme promoter are analogous to the methods disclosed in U.S. Patent Nos. 
4,603,112; 4,769,330; 5,174,993; 5,505,941; 5,338,683; 5,494,807; 4,722,848; Paoletti, E. f 
1996, Proa Natl. Acad. Sci. 93: 11349-11353; Moss, 1996, Proc. Natl. Acad. Sci. 93: 
11341-11348; Roizman, 1996, Proc. Natl Acad Set 93: 11307-11302; Frolov etal, 1996, 

35 Proc. Natl. Acad. Sci. 93: 1 1371-11377; Grunhaus et al., 1993, Seminars in Virology 3: 
237-252 and U.S. Patent Nos. 5,591,639; 5,589,466; and 5,580,859 relating to DNA 
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expression vectors., inter alia- the contents of which are incorporated herein by reference in 
their entireties. 

Recombinant viruses can also be generated by transfection of plasmids into cells 
infected with virus. 

5 Preferably, vectors can replicate {i.e., have a bacterial origin of replication) and be 

manipulated in bacteria (or yeast) and can then be introduced into avian cells. Preferably, 
the vector comprises a marker that is selectable and/or detectable in bacteria or yeast cells 
and, preferably, also in avian cells, such markers include, but are not limited to, Amp', tef , 
LacZ, etc. Preferably, such vectors can accommodate (i.e., can be used to introduce into 

10 cells and replicate) large pieces of DNA such as genomic sequences, for example, large 
pieces of DNA consisting of at least 25 kb, 50 kb, 75 kb, 100 kb, 150 kb, 200 kb or 250 kb, 
such as BACs, YACs, cosmids, etc. 

The insertion of a DNA fragment into a vector can, for example, be accomplished by 
ligating the DNA fragment into a vector that has complementary cohesive termini. 

1 5 However, if the complementary restriction sites used to fragment the DNA are not present 
in the vector, the ends of the DNA molecules maybe enzymatically modified. 
Alternatively, any site desired may be produced by ligating nucleotide sequences (linkers) 
onto the DNA termini; these ligated linkers may comprise specific chemically synthesized 
oligonucleotides encoding restriction endonuclease recognition sequences. In an alternative 

20 method, the cleaved vector and the transgene may be modified by homopolymeric tailing. 
The vector can be cloned using methddsknown in the art, e.g.,by the methods 
disclosed in Sambrook et aL, 2001; Molecular Cloning, A Laboratory Manual, Third 
Edition, Cold Spring Harbor Laboratory Press, N.Y.; Ausubel et al, 1989, Current 
Protocols in Molecular Biology, Green Publishing Associates and Wiley Interscience, N. Y., 

25 both of which are hereby incorporated by reference in their entireties. Preferably, the 
vectors contain cloning sites, for example, restriction enzyme sites that are unique in the 
sequence of the vector and insertion of a sequence at that site would not disrupt an essential 

vector function, such as replication. 

As discussed above, vectors used in certain methods of the invention preferably can 

30 accommodate, and in certain embodiments comprise, large pieces of heterologous DNA 
such as genomic sequences, particularly avian genomic sequences. Such vectors can 
contain an entire genomic locus, or at least sufficient sequence to confer endogenous 
regulatory expression pattern, eg., high level of expression in the magnum characteristic of 
lysozyme, ovafoumin, ovomucoid, ovotransferrin, etc, and to insulate the expression of the 

35 transgene sequences from the effect of regulatory sequences surrounding the site of 
integration of the transgene in the genome. Accordingly, as detailed below, in preferred 

-43- 



WO 03/025146 PCTAJS02/29878 

embodiments, the transgene is inserted in an entire genomic loci or significant portion 
thereof. 

To manipulate large genomic sequences contained in, for example, a BAC, 
nucleotide sequences coding for the heterologous protein to be expressed and/or other 

5 regulatory elements may be inserted into the BAC by directed homologous recombination in 
bacteria, e.g. 9 the methods of Heintz WO 98/59060; Heintz et al, WO 01/05962; Yang et 
al, 1997 Nature BiotechnoL 15: 859-865; Yang et al, 1999, Nature Genetics 22: 327-35; 
which are incorporated herein by reference in their entireties. 

Alternatively, the BAC can also be engineered or modified by "E-T cloning," as 

10 described by Muyrers et al (1999, Nucleic Acids Res. 27(6): 1555-57, incorporated herein 
by reference in its entirety). Using these methods, specific DNA may be engineered into a 
BAC independently of the presence of suitable restriction sites. This method is based on 
homologous recombination mediated by the recE and recT proteins ("ET-cloning") (Zhang 
et aL 9 1998, Nat. Genet. 20(2): 123-28; incorporated herein by reference in its entirety). 

1 5 Homologous recombination can be performed between a PCR fragment flanked by short 
homology arms and an endogenous intact recipient such as a BAC. Using this method, 
homologous recombination is not limited by the disposition of restriction endonuclease 
cleavage sites or the size of the target DNA. A BAC can be modified in its host strain using 
a plasmid, e.g. , pB AD-aPr, in which recE and recT have been replaced by their respective 

20 functional counterparts of phage lambda (Muyrers et al, 1999, ; Nucleic Acids Res. 27(6): 
1555-57). Preferably, a BAC is modified by recombination with a PCR product containing - 
homology arms ranging from 27-60 bp. In a specific embodiment, homology arms are 50 
bp in length. 

In another embodiment, a transgene is inserted into a yeast artificial chromosome 
25 (YAC) (Burke et al, 1987, Science 236: 806-12; and Peterson et al, 1997, Trends Genet 
13:61, both of which are incorporated by reference herein in their entireties). 

In other embodiments, the transgene is inserted into another vector developed for the 
cloning of large segments of genomic DNA, such as a cosmid or bacteriophage PI 
(Sternberg et al, 1990, Proa Natl. Acad Scl USA 87: 103-07). The approximate 
30 maximum insert size is 30-35 kb for cosmids and 100 kb for bacteriophage PI. In another 
embodiment, the transgene is inserted into a P-l derived artificial chromosome (PAC) 
(Mejia et al, 1997, Genome Res 7:179-186). The maximum insert size is 300 kb. 

Vectors containing the appropriate heterologous sequences may be identified by any 
method well known in the art, for example, by sequencing, restriction mapping, 
35 hybridization, PCR amplification, etc. 
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The vectors of the invention comprise one or more nucleotide sequences encoding a 
heterologous protein desired to be expressed in the transgenic avian, as well as regulatory 
elements such as promoters, enhancers, MARs, IRES ? s and other translation control 
elements, transcriptional termination elements, polyadenylation sequences, etc, as discussed 
5 infra. In particular embodiments, the vector of -the invention contains at least two 

nucleotide sequences coding for heterologous proteins, for example, but not limited to, the 
heavy and light chains of an immunoglobulin. 

In a preferred embodiment, the nucleotide sequence encoding the heterologous 
protein is inserted into all or a significant portion of a nucleic acid containing the genomic 
10 sequence of an endogenous avian gene, preferably an avian gene that is expressed in the 
magnum, e.g., lysozyme, ovalbumin, ovomucoid, conalbumin, ovotransferrin, etc. For 
example, the heterologous gene sequence may be inserted into or replace a portion of the 3' 
untranslated region (UTR) or 5' untranslated region (UTR) or an intron sequence of the 
endogenous gene genomic sequence. Preferably, the heterologous gene coding sequence 
IS has its own IRES. For descriptions of IRESes, see, e.g., Jackson et al, 1990, Trends 
Biochem Set 15(12):477-83; Jang et al., 1988, Virol 62(8):2636-43; Jang et al, 1990, 
£nzyrae 44(l-4):292-309; and Martinez-Salas, 1999, Curr. Opin. Biotechnol 10(5):458-64; 
Pahnenberg et ah, United States Patent No. 4,937,190, which are incorporated by reference 
herein in their entireties. In another embodiment, the heterologous protein coding sequence 
20 is inserted at the 3' end of the endogenous gene coding sequence. In another preferred 
embodiment, the heterologous gene coding sequences are inserted using 5' direct fusion 
wherein the heterologous gene coding sequences are inserted in-frame adjacent to the initial 
ATG sequence (or adjacent the nucleotide sequence encoding the first two, three, four, five, 
six, seven or eight amino acids) of the endogenous gene or replacing some or all of the 
25 sequence of the endogenous gene coding sequence. In yet another specific embodiment, the 
heterologous gene coding sequence is inserted into a separate cistron in the 5' region of the 
endogenous gene genomic sequence and has an independent IRES sequence. 

The present invention further relates to nucleic acid vectors (preferably, not derived 
from eukaryotic viruses, except, in certain embodiments, for eukaryotic viral promoters and/ 
30 or enhancers) and transgenes inserted therein that incorporate multiple polypeptide- 
encoding regions, wherein a first polypeptide-encoding region is operatively linked to a 
transcription promoter and a second polypeptide-encoding region is operatively linked to an 
IRES. For example, the vector may contain coding sequences for two different 
heterologous proteins (e.g., the heavy and light chains of an immunoglobulin) or the coding 
35 sequences for all or a significant part of the genomic sequence for the gene from which the 
promoter driving expression of the transgene is derived, and the heterologous protein 
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desired to be exDressed a construct containing the genomic coding sequences, 
including introns, of the avian lysozyme gene when the avian lysozyme promoter is used to 
drive expression of the transgene, an IRES, and the coding sequence for the heterologous 
protein desired to be expressed downstream (Le., 3' on the RNA transcript of the IRES)). 
5 Thus, in certain embodiments, the nucleic acid encoding the heterologous protein is 

introduced into the 5' untranslated or 3' untranslated regions of an endogenous gene, such as 
but not limited to, lysozyme, ovalbumin, ovotransferrin, and ovomucoid, with an IRES 
sequence directing translation of the heterologous sequence. 

Such nucleic acid constructs, when inserted into the genome of a bird and expressed 
1 o therein, will generate individual polypeptides that may be post-translationally modified, for 
example, glycosylated or, in certain embodiments, form complexes, such as heterodimers 
with each other in the white of the avian egg. Alternatively, the expressed polypeptides may 
be isolated from an avian egg and combined in vitro, or expressed in a non-reproductive 
tissue such as serum. In other embodiments, for example, but not limited to, when 
15 expression of both heavy and light chains of an antibody is desired, two separate constructs, 
each containing a coding sequence for one of the heterologous proteins operably linked to a 
promoter (either the same or different promoters), are introduced by microinjection into 
cytoplasm of one or more embryonic cells and transgenic avians harboring both transgenes. 
in their, genomes and expressing both heterologous proteins are identified. Alternatively, 
20 two transgenic avians each containing one of the two heterologous proteins one 
transgenic avian having a transgene encoding the light chain of an antibody and a second 
transgenic avian having a transgene encoding the heavy chain of the antibody) can be bred 
to obtain an avian containing both transgenes in its germline and expressing both transgene 

encoded proteins, preferably in eggs. 

25 Recombinant expression vectors can be designed for the expression of the encoded 

proteins in eukaryotic cells. Useful vectors may comprise constitutive or inducible 
promoters to direct expression of either fusion or non-fusion proteins. With fusion vectors, 
a number of amino acids are usually added to the expressed target gene sequence such as, 
but not limited to, a protein sequence for thioredoxin, a polyhistidine, or any other amino 

30 acid sequence that fecilitates purification of the expressed protein. A proteolytic cleavage 
site may further be introduced at a site between the target recombinant protein and the 
fusion sequence. Additionally, a region of amino acids such as a polymeric histidine region 
may be introduced to allow binding of the fusion protein to metallic ions such as nickel 
bonded to a solid support, and thereby allow purification of the fusion protein. Once the 

35 fusion protein has been purified, the cleavage site allows the target recombinant protein to 
be separated from the fusion sequence. Enzymes suitable for use in cleaving the proteolytic 
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nleava^e site include, but are not limited to, Factor Xa and thrombin. Fusion expression 
vectors that may be useful in the present invention include pGex (AMRAD® Corp., 
Melbourne, Australia), pRIT5 (PHARMACIA®, Piscataway, NJ) and pMAL (NEW 
ENGLAND BIOLABS®, Beverly, MA), fusing glutathione S-transferase, protein A, or 
5 maltose E binding protein, respectively, to the target recombinant protein. 

Once a promoter and a nucleic acid encoding a heterologous protein of the present 
invention have been cloned into a vector system, it is ready to be incorporated into a host 
cell. Such incorporation can be carried out by the various forms of transformation noted 
above, depending upon the vector/host cell system. It is contemplated that the incorporation 
10 of the DNA of the present invention into a recipient cell may be by any suitable method 
such as, but not limited to, viral transfer, electroporation, gene gun insertion, sperm- 
mediated transfer to an ovum, microinjection and the like. Suitable host cells include, but 
are not limited to, bacteria, virus, yeast, mammalian cells, and the like. In particular, the 
present invention contemplates the use of recipient avian cells, such as chicken cells or 
15 quail cells. 

Another aspect of the present invention, therefore, is a method of expressing a 
heterologous polypeptide in a eukaryotic cell by transferring an avian cell with a 
recombinant DNA comprising an avian tissue-specific promoter operably linked to a nucleic 
acid insert encoding a polypeptide and, optionally, a polyadenyiation signal sequence, and 
20 culturing the transfected cell in a medium suitable for expression of the heterologous 
polypeptide under the control of the avian lysozyme gene expression control region. 

Yet another aspect of the present invention is a eukaryotic cell transformed with an 
expression vector according to the present invention and described above. In one 
embodiment of the present invention, the transformed cell is a chicken oviduct cell and the 
25 nucleic acid insert comprises the chicken lysozyme gene expression control region, a 

nucleic acid insert encoding a human interferon a2b and codon optimized for expression in 
an avian cell, and an SV40 polyadenyiation sequence. 

In another embodiment, the transformed cell is a quail oviduct cell and the nucleic 
acid insert comprises the artificial avian promoter construct MDOT (SEQ ID NO.:ll) 
30 operably linked to an mterferon^coding sequence, as described in Example 34 below. 
In yet another embodiment of the present invention, a quail oviduct cell is 
transfected with the nucleic acid insert comprising the MDOT artificial promoter construct 
operably linked to an erythropoietin (EPO>encoding nucleic acid, wherein the transfected 
quail produces heterologous erythropoietin. 
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5.2.1 PROMOTERS 
The vectors of the invention contain promoters that function in avian cells, 
preferably, that are tissue-specific and, in preferred embodiments, direct expression in the 
magnum or serum or other tissue such that expressed proteins are deposited in eggs, more 
5 preferably, that are specific for expression in the magnum. Alternatively, the promoter 
directs expression of the protein in the serum of the transgenic avian. Introduction of the 
vectors of the invention, preferably, generate transgenics that express the heterologous 
protein in tubular gland cells where it is secreted into the oviduct lumen and deposited, e.g., 
into the white of an egg. In preferred embodiments, the promoter directs a level of 
10 expression of the heterologous protein in the egg white of eggs laid by G 0 and/or G, chicks 
and/or their progeny that is greater than 5 ug, 10 ug, 50 ug, 100 ug, 250 ug, 500 ug, or 750 
ug, more preferably greater than 1 mg, 2 mg, 5 mg, 10 mg, 20 mg, 50 mg, 100 mg, 200 mg, 
500 mg, 700 mg, 1 gram, 2 grams, 3 grams, 4 grams or 5 grams. Such levels of expression 
can be obtained using the promoters of the invention. 
1 5 In preferred embodiments, the promoters of the invention are derived from genes 

that express proteins present in significant levels in the egg white and/or the serum. For 
example, the promoter comprises regions of an ovomucoid, ovalbumin, conalbumin, 
lysozyme or ovotransferrin promoter or any other promoter that directs expression of a gene 
in an avian, particularly in a specific tissue of interest, such as the magnum or in the serum. 
20 Alternatively, the promoter used in the expression vector may be derived from that of the 
lysozyme gene that is expressed in both the oviduct and macrophages. Portions of two or 
more of these, and other promoters that function in avians, may be combined to produce 

effective synthetic promoter. 

The promoter may optionally be a segment of the ovalbumin promoter region mat is 

25 sufficiently large to direct expression of the coding sequence in the tubular gland cells. 
Other exemplary promoters include the promoter regions of the ovalbumin, lysozyme, 
ovomucoid, ovotransferrin or ovomucin genes (for example, but not limited to, as disclosed 
in co-pending United States Patent Application Nos. 09/922,549, filed August 3, 2001 and 
10/114,739, filed April .1, 2002, both entitled "Avian Lysozyme Promoter", by Rapp, and 

30 United States Patent Application No. 09/998,716, filed November 30, 2001, entitled 

"Ovomucoid Promoter and Methods of Use," by Harvey et aL, all of which are incorporated 
by reference herein in their entireties). Alternatively, the promoter may be a promoter that 
is largely, but not entirely, specific to the magnum, such as the lysozyme promoter. Other 
suitable promoters may be artificial constructs such as a combination of nucleic acid regions 

35 derived from at least two avian gene promoters. One such embodiment of the present 
invention is the MDOT construct (SEQ ID NO: 1 1) comprising regions derived from the 
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chicken ovomucin and ovotransferrin promoters, including but not limited to promoters 
altered, e.g., to increase expression, and inducible promoters, e.g., the tef system. 

The ovalbumin gene encodes a 45 kD protein that is also specifically expressed in 
the tubular gland cells of the magnum of the oviduct (Beato, 1989, Cell 56:335-344). 

5 Ovalbumin is the most abundant egg white protein, comprising over 50 percent of the total 
protein produced by the tubular gland cells, or about 4 grams of protein per large Grade A 
egg (Gilbert, lc Egg albumen and its formation" in Physiology and Biochemistry of the 
Domestic Fowl, Bell and Freeman, eds., Academic Press, London, New York, pp. 1291- 
1329). The ovalbumin gene and over 20 kb of each flanking region have been cloned and 

10 analyzed (Lai et al, 1978, Proc. Natl Acad. Set. USA 75:2205-2209; Gannon etai, 1979, 
Nature 278:428-424; Roop et a/.,1980, Cell 19:63-68; and Royal et a/.,1975, Nature 
279:125-132). 

The ovalbumin gene responds to steroid hormones such as estrogen, glucocorticoids, 
and progesterone, which induce the accumulation of about 70,000 ovalbumin mRNA 

15 transcripts per tubular gland cell in immature chicks and 100,000 ovalbumin mRNA 
transcripts per tubular gland cell in the mature laying hen (Palmiter, 1973, J. Biol. Chem. 
248:8260-8270; Palmiter, 1975, Cell 4:1 89-197). The 5' flanking region contains four 
DNAse I-hypersensitive sites centered at -0.25, -0.8, -3.2, and -6.0 kb from the transcription 
start site. These sites are called HS-T, II, -HI, and -IV, respectively. Promoters of the 

20 invention may contain one, all, or a combination of HS-I, HS-II, HS-IH and HS0IV. 

Hypersensitivity of HS-II and -HI are estrogen-induced, supporting a role for these regions 
in hormone-induction of ovalbumin gene expression. 

HS-I and HS-II are both required for steroid induction of ovalbumin gene 
transcription, and a 1.4 kb portion of the 5" region that includes these elements is sufficient 



PI 
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and McKnight, 1988, Biochemistry 27: 6550-6557). HS-I is termed the negative-response 
element ("NRE") because it contains several negative regulatory elements which repress 
ovalbumin expression in the absence of hormone (Haekers et al., 1995, MoL Endo. 9:1 1 13- 
1 126). Protein factors bind these elements, including some factors only found in oviduct 

30 nuclei suggesting a role in tissue-specific expression. HS-H is termed the steroid-dependent 
response element ("SDRE") because it is required to promote steroid induction of 
transcription. It binds a protein or protein complex known as Chirp-I. Chirp-I is induced by 
estrogen and turns over rapidly in the presence of cyclohexamide (Dean et al., 1996, MoL 
Cell. Biol. 16:2015-2024). Experiments using an explanted tubular gland cell culture 

35 system defined an additional set of factors that bind SDRE in a steroid-dependent manner, 
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including a NFicB-like factor (Nordstrom et al, 1993, 1 Biol. Chem. 268:13193-13202; 
Schweers and Sanders, 1991,/. Biol Chem, 266: 10490-10497). 

Less is known about the function ofHS-HI and HS-IV. HS-m contains a functional 
estrogen response element, and confers estrogen inducibility to either the ovalbumin 
5 proximal promoter or a heterologous promoter when co-transfected into HeLa cells with an 
estrogen receptor cDNA. These data imply that HS-HI may play a functional role in the 
overall regulation of the ovalbumin gene. Little is known about the function of HS-IV, 
except that it does not contain a functional estrogen-response element (Kato et al., 1992, 
Cell 68: 731-742). 

10 hi an alternative embodiment of the invention, transgenes containing constitutive 

promoters are used, but the transgenes are engineered so that expression of the transgene 
effectively becomes magnum-specific. Thus, a method for producing an exogenous protein 
in an avian oviduct provided by the present invention involves generating a transgenic avian 
having two transgenes in its tubular gland cells. One transgene comprises a first coding 

15 sequence operably linked to a constitutive promoter. The second transgene comprises a 
second coding sequence that is operably linked to a magnum-specific promoter, where 
expression of the first coding sequence is either directly or indirectly dependent upon the 
cellular presence of the protein expressed by the second coding sequence. 

Additional promoters useful in the present invention include inducible promoters, 

20 such as the tet operator and the metallothionein promoter which can be induced by 
treatment with tetracycline and zinc ions, respectively (Gossen et al, 1992, Proc. Natl 
Acad. Sci. 89: 5547-5551 and Walden et al, 1987, Gene 61: 317-327; incorporated herein 
by reference in their entireties). 

25 Chicken lvsozvme gene expression control region nucleic acid s equences: 

The chicken lysozyme gene is highly expressed in the myeloid lineage of 
hematopoietic cells, and in the tubular glands of the mature hen oviduct (Hauser et al, 
\9%\,Hematol and Blood Transfusion 26: 175-178; Schutz et al, 1978, Cold Spring 
Harbor Symp. Quart Biol. 42: 617-624) and is therefore a suitable candidate for an efficient 

30 promoter for heterologous protein production in trarisgenic animals. The regulatory region 
of the lysozyme locus extends over at least 12 kb of DNA 5' upstream of the transcription 
start site, and comprises a number of elements that have been individually isolated and 
characterized. The known elements include three enhancer sequences at about -6.1 kb, -3.9 
kb, and -2.7 kb (Grewal et al, 1992, Mol Cell Biol 12: 2339-2350; Bonifer et al, 1996, J. 

35 Mol Med. 74: 663-671), a hormone responsive element (Hecht et al, 1988, E.M.B.O.J. 7: 
2063-2073), a silencer element and a complex proximal promoter. The constituent 
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elements of the lysozyme gene expression control region are identifiable as DNAase t 
hypersensitive chromatin sites (DBS). They may be differentially exposed to nuclease 
digestion depending upon the differentiation stage of the cell. For example, in the 
multipotent progenitor stage of myelomoncytic cell development, or in erythroblasts, the 
5 silencer element is a DHS . At the myeloblast stage, a transcription enchancer located -6. 1 
kb upstream from the gene transcription start site is a DHS, while at the later monocytic 
stage another enhancer, at -2.7 kb becomes DNAase sensitive (Huber et a/., 1995, DNA and 

Cell Biol 14:397-402). 

This invention also envisions the use of promoters other than the lysozyme 

10 promoter, including but not limited to, a cytomegalovirus promoter, an ovomucoid, 

conalburnin or ovotransferrin promoter or any other promoter that directs expression of a 
gene in an avian, particularly in a specific tissue of interest, such as the magnum. 

Another aspect of the methods of the present invention is the use of combinational 
promoters comprising an artificial nucleic acid construct having at least two regions 

1 5 wherein the regions are derived from at least two gene promoters, including but not limited 
to a lysozyme, ovomucoid, conalburnin or ovotransferrin promoter, hi one embodiment of 
the present invention, the promoter may comprise a region of an avian ovomucoid promoter 
and a region of an avian oxotransferrin promoter, thereby generating the MDOT avian 
artificial promoter construct as described in Example 12, below. The avian MDOT 

20 promoter construct of the present invention has the nucleic acid sequence SEQ ID NO: 1 1 
and is illustrated in FIG. 14. This promoter is useful for allowing expression of a 
heterologous protein in chicken oviduct cells and may be operably linked to any nucleic 
acid encoding a heterologous polypeptide of interest including, for example, a cytokine, 
growth hormone, growth factor, enzyme, structural protein or the like. 

25 

5.2.2 MATRIX ATTACHMENT REGIONS 
In preferred embodiments of the invention, the vectors contain matrix attachment 
regions (MARs) mat preferably flank the transgene sequences to reduce position effects on 
expression when integrated into the avian genome, m fact, 5 r MARs and 3' MARs (also 

30 referred to as "scaffold attachment regions" or SARs) have been identified in the outer 
boundaries of the chicken lysozyme locus (Phi-Van et al 9 1988, EM.B.OJ. 7: 655-664; 
Phi-Van, L. and Sterling, W.H., 1996, Biochem. 35: 10735-10742). Deletion of a 1.32 kb 
or a 1.45 kb halves region, each comprising half of a 5' MAR, reduces positional variation 
in the level of transgene expression (Phi-Van and Stratling, supra). 

35 The 5 ' matrix-associated region (5 ' MAR), located about -1 1 .7 kb upstream of the 

chicken lysozyme transcription start site, can increase the level of gene expression by 
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limiting the positional effects exerted against a transgene (Phi- Van et aL, 1988, supra). At 
least one other MAR is located 3 ' downstream of the protein encoding region. Although • 
MAR nucleic acid sequences are conserved, little cross-hybridization is seen, indicating 
significant overall sequence variation. However, MARs of different species can interact 
5 with the nucleomatrices of heterologous species, to the extent that the chicken lysozyme 
MAR can associate with the plant tobacco nucleomatrix as well as that of the chicken 
oviduct cells (Mlynarona et aL, 1994, Cell 6: 417-426; von Kries et aL, 1990, Nucleic Acids 

Res. 18:3881-3885). 

Gene expression must be considered not only from the perspective of cis-regulatory 

10 elements associated with a gene, and their interactions with trans-acting elements, but also 
with regard to the genetic environment in which they are located. Chromosomal positioning 
effects (CPEs), therefore, axe the variations in levels of transgene expression associated with 
different locations of the transgene within the recipient genome. An important factor 
governing CPE upon the level of transgene expression is the chromatin structure around a 

1 5 transgene, and how it cooperates with the cis-regulatory elements. The cis-elements of the 
lysozyme locus are confined within a single chromatin domain (Bonifer et aL, 1996, supra; 
Sippei et aL, pgs. 133-147 in Eckstein F. & Lilley D.MJ. (eds), "Nucleic Acids and 
Molecular Biology", Vol. 3, 1989, Springer. 

The lysozyme promoter region of chicken is active when trassfected into mouse 

20 fibroblast cells and linked to a reporter gene such as the bacterial chloramphenicol 
acetyltransfexase (CAT) gene. The promoter element is also effective when transiently 
transfected into chicken promacrophage cells. In each case, however, the presence of a 5 ' 
MAR element increased positional independency of the level of transcription (Stief et aL, 
1989, Nature 341: 343-345; Sippei et aL, pgs. 257 - 265 in Houdebine L.M. (ed), 

25 "transgenic Animals: Generation and Use"). 

The ability to direct the insertion of a transgene into a site in the genome of an 
animal where the positional effect is limited offers predictability of results during the 
development of a desired transgenic animal, and increased yields of the expressed product 
Sippei and Steif disclose, in U.S. Patent No. 5,731,178, which is incorporated by reference 

30 herein in its entirety, methods to increase the expression of genes introduced into eukaryotic 
cells by flanking a transcription unit with scaffold attachment elements, in particular the 5' 
MAR isolated from the chicken lysozyme gene. The transcription unit disclosed by Sippei 
and Steif was an artificial construct that combined only the -6.1 kb enhancer element and 
the proximal promoter element (base position -579 to +15) from the lysozyme gene. Other 
35 promoter associated elements were not included. However, although individual cis- 
regulatory elements have been isolated and sequenced, together with short regions flanking 
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DNA, the entire nucleic acid sequence comprising the functional 5' upstream region of the 
lysozyme gene has not been determined in its entirety and therefore not employed as a 
functional promoter to allow expression of a heterologous transgene. 

Accordingly, vectors of the invention comprise MARs, preferably both 5' and 3* 
5 MARs that flank the transgene, including the heterologous protein coding sequences and the 
regulatory sequences. 

5.2.3 NUCLEAR LOCALIZATION SIGNAL PEPTIDES 

Targeting of the nucleic acids introduced into embryonic cells using methods of the 
10 invention may be enhanced by mixing the nucleic acid to be introduced with a nuclear 
localization signal (NLS) peptide prior to introduction, e.g., microinjection, of the nucleic 
acid. Nuclear localization signal (NLS) sequences are a class of short amino acid sequences 
which maybe exploited for cellular import of linked cargo into a nucleus. The present 
invention envisions the use of any NLS peptide, including but not limited to, the NLS 

1 5 peptide of SV40 virus T-antigen. 

An NLS sequence of the invention is an amino acid sequence which mediates 
nuclear transport into the nucleus, wherein deletionn of the NLS prevents nuclear transport. 
In particular embodiments, a NLS is a highyl cationic peptide. The present invention 
envisions the use of any NLS sequence, including but not limited to, S V40 . virus T-antigen. 

20 NLSs known in the art include, but are not limited to those discussed in Cokol et al. 7 2000, 
EMBO Reports, 1(5):411-415, Boulikas, T., 1993, CriU Rev. Eukaryot. GeneExpr., 3:193- 
227, Collas, P. et al., 1996, Transgenic Research, 5: 451^58, Collas and Alestrom, 1997, 
Biochem. Cell Biol 75: 633-640, Collas and Alestrom, 1998, Transgenic ResarcK 7: 303- 
309, Collas and Alestrom, Mol Reprod DeveL, 1996, 45:431-438, all of which are 

25 incorporated by reference in their entireties. 

5.2.4 CODON-OPTIMIZED GENE EXPRESSION 

Another aspect of the present invention provides nucleic acid sequences encoding 
heterologous polypeptides that are wdon-c^timized for expression in avian cells, and 

30 derivatives and fragments thereof. When a heterologous nucleic acid is to be delivered to a 
recipient cell for expression therein, the sequence of the nucleic acid sequence may be 
modified so that the codons are optimized for the codon usage of the recipient species. For 
example, if the heterologous nucleic acid is transfected into a recipient chicken cell, the 
sequence of the expressed nucleic acid insert is optimized for chicken codon usage. This 

35 may be determined from the codon usage of at least one, and preferably more than one, 
protein expressed in a chicken cell. For example, the codon usage may be determined from 
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the nucleic acid sequences encoding the proteins ovalbumin, lysozyme, ovomucin and 
ovotransferrin of chicken. Briefly, the DNA sequence for the target protein may be 
optimized using the BACKTRANSLATE® program of the Wisconsin Package, version 9.1 
(Genetics Computer Group, Inc., Madison, WI) with a codon usage table compiled from the 

5 chicken (Gatlus gallus) ovalbumin, lysozyme, ovomucoid, and ovotransferrin proteins. The 
template and primer oligonucleotides are then amplified, by any means known in the art, 
including but not limited to PCR with Pfu polymerase (STRATAGENE®, La Jolla CA). 

In one exemplary embodiment of a heterologous nucleic acid for use by the methods 
of the present invention, a nucleic acid insert encoding the human interferon ct2b 

10 polypeptide optimized for codon-usage by the chicken is microinjected into the cytoplasm 
of a stage 1 embryo. Optimization of the sequence for codon usage is useful in elevating the 

level of translation in avian eggs. 

It is contemplated to be within the scope of the present invention for any nucleic 
acid encoding a polypeptide to be optimized for expression in avian cells. It is further 
1 5 contemplated that the codon usage may be optimized for a particular avian species used as a 
source of the host cells. In one embodiment of the present invention, the heterologous 
polypeptide is encoded using the codon-usage of a chicken. 

5.2.5 SPECIFIC VECTORS OF THE INVENTION 

20 hi a preferred embodiment, a transgene of the invention comprises a chicken, or 

other avian, lysozyme control region sequence which directs expression of the coding 
sequence within the transgene. A series of PCR amplifications of template chicken 
genomic DNA are used to isolate the gene expression control region of the chicken 
lysozyme locus. Two amplification reactions used the PCR primer sets 5pLMAR2 (5'- 

25 TGCCGCCTTCTTTGATATTC-3 1 ) (SEQ ID NO: 1) and LE-6.1kbrevl (5'- 
TTGGTGGTAAGGCCITTTTG-3') (SEQ ID NO: 2) (Set 1) and lys-6. 1 (5'- 
CTGGCAAGCTGTCAAAAACA-3 1 ) (SEQ ID NO: 3) and LysElRev (5'- 
CAGCTCACATCGTCCAAAGA-3') (SEQ ID NO: 4) (Set 2). The amplified PCR 
products were united as a contiguous isolated nucleic acid by a third PCR amplification step 

30 with the primers SEQ ID NOS: 1 and 4, as described in Example 9 below. 

The isolated PCR-amplified product, comprising about 12 kb of the nucleic acid 
region 5 ' upstream of the native chicken lysozyme gene locus, was cloned into the plasmid 
pOVlV-LysSPIFNMM. pCMV-LysSPIFNMM comprises a modified nucleic acid insert 
encoding a human interferon o2b sequence and an SV40 polyadenylation signal sequence 

35 (SEQ ID NO: 8) 3 ' downstream of the interferon encoding nucleic acid. The sequence SEQ 
ID NO: 5 of the nucleic acid insert encoding human interferon o2b was in accordance with 
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avian cell codon usage, as determined from the nucleotide sequences encoding chicken 
ovomucin, ovalbumin, ovotransferrin and lysozyme. 

The nucleic acid sequence (SEQ ID NO: 6) (GenBank Accession No. AF405538) of 
the insert in pAVUCR-Al 15.93.1.2 is shown in FIG. 1A-E. The modified human interferon 

> a2b encoding nucleotide sequence SEQ ID NO: 5 (GenBank Accession No. AF405539) and 
the novel chicken lysozyme gene expression control region SEQ ID NO: 7 (GenBank 
Accession No. AF405540), shown in FIGS. 2 and 3A-E respectively. A polyadenyiation 
signal sequence that is suitable for operably Unking to the polypeptide-encoding nucleic 
acid insert is the SV40 signal sequence SEQ ID NO: 8, as shown in FIG. 4. 

!0 The piasmid pAVUCR-Al 15.93.1.2 was restriction digested with enzyme Fsel to 

isolate a 1 5.4 kb DNA containing the lysozyme 5' matrix attachment region (MAR) and the 
-12.0 kb lysozyme promoter during the expression of the interferon-encoding insert, as 
described in Example 10, below. Piasmid plffilys was restriction digested with Mlul and 
Xhol to isolate an approximately 6 kb nucleic acids, comprising the 3' lysozyme domain, the 

15 sequence of which (SEQ ID NO: 9) is shown in FIG. 5A-C. The 15.4 kb and 6 kb nucleic 
acids were ligated and the 21.4 kb nucleic acid comprising the nucleic acid sequence SEQ 
ID NO: 10 as shown in FIG. 6A-J was transformed into recipient STBL4 cells as described 

in Example 10, below. 

The inclusion of the novel isolated avian lysozyme gene expression control region of 

20 the present invention upstream of a codon-optimized interferon-encoding sequence in 
pAVUCR-Al 15.93.1.2 allowed expression of the interferon polypeptide in avian cells 
transfected by cytoplasmic microinjection, as described in Examples 3 and 4; below. The 3' 
lysozyme domain SEQ ID NO: 9, when operably linked downstream of a heterologous 
nucleic acid insert, also allows expression of the nucleic acid insert as described in Example 

25 11, below. For example, the nucleic acid insert may encode a heterologous polypeptide 
such as the a2b interferon encoded by the sequence SEQ ID NO: 5. 

It is further contemplated that any nucleic acid sequence encoding a polypeptide may 
be operably linked to the novel isolated avian lysozyme gene expression control region 
(SEQ ID NO: 7) and optionally operably linked to the 3 f lysozyme domain SEQ ID NO. 9 so 

30 as to be expressed in a transfected avian cell. The piasmid construct pAVUCR-Al 15.93. 1.2 
when transfected into cultured quail oviduct cells, which were then incubated for about 72 
hours. ELBA assays of the cultured media showed that the transfected cells synthesized a 
polypeptide detectable with anti-human interferon a2b antibodies. Piasmid construct 
pAVDCR-A212.89.2.1 andpAVUCR-A212.89.2.3 transfected into chicken 

35 myelomonocytic HD1 1 cells yield detectable human a2b interferon, as described in 
Example 3 and 4 below, and shown in FIGS. 8-12. 
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The isolated chicken lysozyme gene expression control region (SEQ ID NO: 7) for 
use in the methods of the present invention comprises the nucleotide elements that are 
positioned 5' upstream of the lysozyme-encoding region of the native chicken lysozyme 
locus and which are necessary for the regulated expression of a downstream polypeptide- 

5 encoding nucleic acid. While not wishing to be bound by any one theory, the inclusion of at 
least one 5' MAR sequence of or reference element in the isolated control region may 
confer positional independence to a transfected gene operably linked to the novel lysozyme 
gene expression control region. 

The isolated lysozyme gene expression control region (SEQ ID NO: 7) of the present 

10 invention is useful for reducing the chromosomal positional effect of a transgene operably 
linked to the lysozyme gene expression control region and transfected into a recipient avian 
cell. By isolating a region of the avian genome extending from a point 5' upstream of a 5 r 
MAR of the lysozyme locus to the junction between the signal peptide sequence and a 
polypeptide-encoding region, cis-regulatory elements are also included that may allow gene 

1 5 expression in a tissue-specific manner. The lysozyme promoter region of the present 
invention, therefore, will allow expression of an operably linked heterologous nucleic acid 
insert in a transfected avian cell such as, for example, an oviduct cell. 

It is further contemplated that a recombinant DNA of the present invention may 
further comprise the chicken lysozyme 3' domain (SEQ. ID NO: 9) linked downstream of ; 

20 the nucleic acid insert encoding a heterologous polypeptide. The lysozyme 3' domain (SEQ 
ID NO: 9) includes a nucleic acid sequence encoding a 3 f MAR domain that may cooperate 
with a 5' MAR to direct the insertion of the construct of the present invention into the 
chromosome of a transgenic avian, or may act independently of the 5' MAR 

Fragments of a nucleic acid encoding a portion of the subject lysozyme gene 

25 expression control region may also be useful as an autonomous gene regulatory element that 
may itself be operably linked to a polypeptide-encoding nucleic acid. Alternatively, the 
fragment may be combined with fragments derived from other gene promoters, such as an 
avian ovalbumin or ovomucoid promoter, thereby generating novel promoters having new 
properties or a combination of properties. As used herein, a fragment of the nucleic acid 

30 encoding an active portion of a lysozyme gene expression control region refers to a 
nucleotide sequence having fewer nucleotides than the nucleotide sequence encoding the 
entire nucleic acid sequence of the lysozyme gene expression control region, but at least 200 
nucleotides. 

The present invention also contemplates the use of antisense nucleic acid molecules 
35 that are designed to be complementary to a coding strand of a nucleic acid 

complementary to an endogenous DNA or an mRNA sequence) or, alternatively, 

-56- 



WO 03/025146 PCT/US02/29878 

complimentary to a 5' or 3' untranslated region of the mRNA and therefore useful for 

regulating the expression of a gene by the lysozyme promoter. 

Synthesized oligonucleotides can be produced in variable lengths when for example, 

non-naturally occurring polypeptide sequences are desired. The number of bases 
5 synthesized will depend upon a variety of factors, including the desired use for the probes or 

primers. Additionally, sense or anti-sense nucleic acids or oligonucleotides can be 

chemically synthesized using modified nucleotides to increase the biological stability of the 

molecule or of the binding complex formed between the anti-sense and sense nucleic acids. 

For example, acridine substituted nucleotides can be synthesized. Protocols for designing 
10 isolated nucleotides, nucleotide probes, and/or nucleotide primers are well-known to those 

of ordinary skill, and can be purchased commercially from a variety of sources [e.g., 
SIGMA GENOSYS®, The Woodlands, TX or The Great American Gene Co., Ramona, 

CA). 

15 5.2.6 RECOMBINANT EXPRESSION VECTORS 

A useful application of the novel promoters of the present invention, such as the 
avian lysozyme gene expression control region (SEQ ID NO: 7) or the MDOT promoter 
construct (SEQ ID NO: 1 1 , Example 12, below) is the possibility of increasing the amount 
of a heterologous protein present in.a bud, especially a chicken, by gene transfer. In most 
20 instances, a heterologous polypeptide-encoding nucleic acid insert transferred into the 
recipient animal host will be operably linked with a gene expression control region to allow 
the cell to initiate and continue production of the genetic product protein. A recombinant 
DNA molecule of the present invention can be transferred into the extra-chromosomal or 

genomic DNA of the host. 
25 Expression of a foreign gene in an avian cell permits partial or complete post- 

translational modification such as, but not only, glycosylation, as shown, for example, in 
FIGS. 10-12, and/or the formation of the relevant inter- or intra-chain disulfide bonds. 
Examples of vectors useful for expression in the chicken Gallus gallus include pYepSecl 
(Baldari et al, 1987, E.M.B.O.J., 6: 229-234; incorporated herein by reference in its 

30 entirety) and pYES2 (INVTTORGEN® Corp., San Diego, CA). 

The present invention contemplates that the injected cell may transiently contain the 
injected DNA whereby the recombinant DNA or expression vector may not be integrated 
into the genomic nucleic acid. It is further contemplated that the injected recombinant DNA 
or expression vector may be stably integrated into the genomic DNA of the recipient cell, 

35 thereby repeating with the cell so that each daughter cell receives a copy of the injected 
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nucleic acid. It is still further contemplated for the scope of the present invention to include 
a transgenic animal producing a heterologous protein expressed from an injected nucleic 
acid according to the present invention. 

Heterologous nucleic acid molecules can be delivered to cells using the cytoplasmic 

5 microinjection method or any other method of the present invention. The nucleic acid 

molecule may be inserted into a cell to which the nucleic acid molecule (or promoter coding 
region) is heterologous (i.e., not normally present). Alternatively, the recombinant DNA 
molecule may be introduced into cells which normally contain the recombinant DNA 
molecule or the particular coding region, as, for example, to correct a deficiency in the 

1 0 expression of a polypeptide, or where over-expression of the polypeptide is desired. 

Another aspect of the present invention, therefore, is a method of expressing a 
heterologous polypeptide in an avian cell by transfecting the avian cell with a selected 
heterologous nucleic acid comprising an avian promoter operably linked to a nucleic acid 
insert encoding a polypeptide and, optionally, a polyadenylation signal sequence. The 

15 transfected cell, which may be an avian embryonic cell microinjected with a heterologous 
nucleic acid, will generate a transgenic embryo that after introduction into a recipient hen 
will be laid as a hard-shell egg and develop into a transgenic chick. 

In another embodiment of the present invention, the nucleic acid insert comprises 
the chicken lysozyme gene expression control region, a nucleic acid insert encoding a 

20 human interferon a2b and codon optimized for expression in an avian cell, and a chicken 3' 
domain, z.e., downstream enhancer elements. 

hi one embodiment of the present invention, the transgenic animal is an avian 
selected from a turkey, duck, goose, quail, pheasant, ratite, and ornamental bird or a feral 
bird, hi another embodiment, the avian is a chicken and the heterologous polypeptide 

25 produced under the transcriptional control of the avian promoter is produced in the white of 
an egg. In yet another embodiment of the present invention, the heterologous polypeptide is 
produced in the serum of a bird. 

53 HETEROLOGOUS PROTEINS PRODUCED BY TRANSGENIC 
AVIANS 

30 

* 

Methods of the present invention, providing for the production of heterologous 
protein in the avian oviduct (or other tissue leading to deposition of the protein into the egg) 
and the production of eggs containing heterologous protein, involve providing a suitable 
vector coding for the heterologous protein and introducing the vector into embryonic cells 
35 such as a single cell embryo such that the vector is integrated into the avian genome. A 
subsequent step involves deriving a mature transgenic avian from the transgenic embryonic 
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cells produced in the previous steps by transferring the injected cell or cells into the 
mfundibulum of a recipient hen; producing a hard shell egg from that hen; and allowing the 
egg to develop and hatch to produce a transgenic bird. 

A transgenic avian so produced from transgenic embryonic cells is known as a 

5 founder. Such founders may be mosaic for the transgene (in certain embodiments, the 

founder has 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 90%, 100% of the cells containing 
the transgene. The invention further provides production of heterologous proteins in other 
tissues of the transgenic avians. Some founders will carry the transgene in the tubular gland 
cells in the magnum of their oviducts. These birds will express the exogenous protein 

10 encoded by the transgene in their oviducts. If the exogenous protein contains the 

appropriate signal sequences, it will be secreted into the lumen of the oviduct and into the 
white of an egg. 

Some founders are germ-line founders. A germ-line founder is a founder that carries 
the transgene in genetic material of its germ-line tissue, and may also carry the transgene in 

1 5 oviduct magnum tabular gland cells that express the exogenous protein. Therefore, in 
accordance with the invention, the transgenic bird may have tabular gland cells expressing 
the exogenous protein and the offspring of the transgenic bird will also have oviduct 
magnum tubular gland cells that express the exogenous protein. Alternatively, the offspring 
express a phenotype determined by expression of the .exogenous gene in a specific tissue of 

20 the avian. In preferred embodiments, the heterologous proteins are produced from 

transgenic avians that were not (or the founder ancestors were not) using a eukaryotic viral 

vector, or a retroviral vector. 

The present invention can be used to express, in large yields and at low cost, a wide 
range of desired proteins including those used as human and animal pharmaceuticals, 

25 diagnostics, and livestock feed additives. Proteins such as growth hormones, cytokines, 
structural proteins and enzymes, including human growth hormone, interferon, lysozyme, 
and p-casein, are examples of proteins that are desirably expressed in the oviduct and 
deposited in eggs according to the invention. Other possible proteins to be produced 
include, but are not limited to, albumin, a-1 antitrypsin, antithrombin m, collagen, factors 

30 Vm, DC, X (and the like), fibrinogen, hyaluronic acid, insulin, lactoferrin, protein C, 



erythropoietin 



macrophage colony-stimulating factor (GM-CSF), tissue-type plasminogen activator (tPA), 
feed additive enzymes, somatotropin, and chymotrypsin Immunoglobulins and genetically 
engineered antibodies, including immunotoxins that bind to surface antigens on human 
35 tumor cells and destroy them, can also be expressed for use as pharmaceuticals or 

diagnostics. It is contemplated that immunoglobulin polypeptides expressed in avian cells 
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following transfection by the methods of the present invention may include monomelic 
heavy and light chains, single-chain antibodies or multimeric immunoglobulins comprising 
variable heavy and light chain regions, i.e., antigen-binding domains, or intact heavy and 
light immunoglobulin chains. 

5 

5.3.1 PROTEIN RECOVERY 

The protein of the present invention may be produced in purified form by any known 
conventional technique. For example, chicken cells may be homogenized and centrifuged. 
The supernatant can then be subjected to sequential ammonium sulfate precipitation and 

10 heat treatment. The fraction containing the protein of the present invention is subjected to 
gel filtration in an appropriately sized dextran or polyacrylamide column to separate the 
proteins. If necessary, the protein fraction may be further purified by HPLC. In another 
embodiment, an affinity column is used, wherein the protein is expressed with a tag. 

Accordingly, the invention provides proteins that are produced by transgenic avians 

15 of the invention. In a preferred embodiment, the protein is produced and isolated from an 
avian egg. In another embodiment, the protein is produced and isolated from avian serum. 

5.3.2 MULTIMERIC PROTEINS 

The invention, in preferred embodiments, provides methods for producing 
20 multimeric proteins, preferably immunoglobulins, such as antibodies, and antigen binding 
fragments thereof. 

In one embodiment of the present invention, the multimeric protein is an : 
immunoglobulin, wherein the first and second heterologous polypeptides are an 
immunoglobulin heavy and light chains respectively. Illustrative examples of this and other 

25 aspects and embodiments of the present invention for the production of heterologous 
multimeric polypeptides in avian cells are fully disclosed in U.S. Patent Application No. 
09/877,374, filed June 8, 2001, by Rapp, which is incorporated herein by reference in its 
entirety. In one embodiment of the present invention, therefore, the multimeric protein is an 
immunoglobulin wherein the first and second heterologous polypeptides are an 

30 immunoglobulin heavy and light chain respectively. Accordingly, the invention provides 
immunoglobulin and other multimeric proteins that have been produced by transgenic 
avians of the invention. 

In the various embodiments of this aspect of the present invention, an 
immunoglobulin polypeptide encoded by the transcriptional unit of at least one expression 

35 vector may be an immunoglobulin heavy chain polypeptide comprising a variable region or 
a variant thereof^ and may further comprise a D region, a J region, a C region, or a 
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combination thereof. An immunoglobulin polypeptide encoded by the transcriptional unit 
of an expression vector may also be an immunoglobulin light chain polypeptide comprising 
a variable region or a variant thereof, and may further comprise a J region and a C region: It 
is also contemplated to be within the scope of the present invention for the immunoglobulin 
5 regions to be derived from the same animal species, or a mixture of species including, but 
not only, human, mouse, rat, rabbit and chicken. In preferred embodiments, the antibodies 
are human or humanized 

In other embodiments of the present invention, the immunoglobulin polypeptide 
encoded by the transcriptional unit of at least one expression vector comprises an 
10 immunoglobulin heavy chain variable region, an immunoglobulin light chain variable 
region, and a linker peptide thereby forming a single-chain antibody capable of selectively 
binding an antigen. 

Another aspect of the present invention provides a method for the production in an 
avian of an heterologous protein capable of forming an antibody suitable for selectively 

15 binding an antigen comprising the step of producing a transgenic avian incorporating at 
least one transgene, wherein the transgene encodes at least one heterologous polypeptide 
selected from an immunoglobulin heavy chain variable region, an immunoglobulin heavy 
chain comprising a variable region and a constant region, an immunoglobulin light chain 
variable region, an immunoglobulin light chain comprising a variable region and a constant 

20 region, and a single-chain antibody comprising two peptide-linked immunoglobulin variable 
regions. Preferably, the antibody is expressed such that it is deposited in the white of the 
developing eggs of the avian. The hard shell avian eggs thus produced can be harvested and 
the heterologous polypeptide capable of forming or which formed an antibody can be 
isolated from the harvested egg. It is also understood that the heterologous polypeptides 

25 may also be expressed under the transcriptional control of promoters that allow for release 
of the polypeptides into the serum of the transgenic animal. Exemplary promoters for non- 
tissue specific production of a heterologous protein are the CMV promoter and the RSV 
promoter. 

In one embodiment of this method of the present invention, the transgene comprises 
30 a transcription unit encoding a first and a second immunoglobulin polypeptide operatively 
linked to a transcription promoter, a transcription terminator and, optionally, an internal 
ribosome entry site QRES)(see 9 for example, U.S. Patent No. 4,937,190 to Palmenberg et 
aL, the contents of which is incorporated herein by reference in its entirety). 

hi an embodiment of this method of the present invention, the isolated heterologous 
35 protein is an antibody capable of selectively binding to an antigen. In this embodiment, the 
antibody may be generated within the serum of an avian or within the white of the avian egg 
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by combining at least one immunoglobulin heavy chain variable region and at least one 
immunoglobulin light chain variable region, preferably cross-linked by at least one di- 
sulfide bridge. The combination of the two variable regions will generate a binding site 
capable of binding an antigen using methods for antibody reconstitution that are well known 
5 in the art. 

It is, however, contemplated to be within the scope of the present invention for 
immunoglobulin heavy and Ught.chains, or variants or derivatives thereof, to be expressed 
in separate transgenic avians, and therefore isolated from separate media including serum or 
eggs, each isolate comprising a single species of immunoglobulin polypeptide. The method 
10 may further comprise the step of combining a plurality of isolated heterologous 

immunoglobulin polypeptides, thereby producing an antibody capable of selectively binding 
to an antigen. In this embodiment, two individual transgenic avians may be generated 
wherein one transgenic produces serum or eggs having an immunoglobulin heavy chain 
variable region, or a polypeptide comprising such, expressed therein. A second transgenic 
15 animal, having a second transgene, produces serum or eggs having an immunoglobulin light 
chain variable region, or a polypeptide comprising such, expressed therein. The 
polypeptides may be isolated from their respective sera and eggs and combined in vitro to 
generate a binding site capable of binding an antigen. 

Examples of therapeutic antibodies that can be used in methods of the invention 
20 include but are not limited to HERCEPTIN® (Trastuzumab) (Genentech, CA) which is a 
humanized anti-HER2 monoclonal antibody for the treatment of patients with metastatic 
breast cancer, REOPRO® (abciximab) (Centocor) which is an anti-giycoprotein Hb/HIa 
receptor on the platelets for the prevention of clot formation; ZENAPAX® (daclizumab) 
(Roche Pharmaceuticals, Switzerland) which is an immunosuppressive, humanized anti- 
25 CD25 monoclonal antibody for the prevention of acute renal allograft rejection; 

PANOREX™ which is a murine anti-17-IA cell surface antigen IgG2a antibody (Glaxo 
Wellcome/Centocor); BEC2 which is a murine anti-idiotype (GD3 epitope) IgG antibody 
(ImClone System); IMC-C225 which is a chimeric anti-EGFR IgG antibody (LmClone 
System); VTTAXIN™ which is a humanized anti-aVp3 integrin antibody (Applied 
30 Molecular Evolution/Medlmmune); Campath 1H/LDP-03 which is a humanized anti CD52 
IgGl antibody (Leukosite); Smart M195 which is a humanized anti-CD33 IgG antibody 
(Protein Design Lab/Kanebo); RITUXAN™ which is a chimeric anti-CD20 IgGl antibody 
QDEC Pharm/Genentech, Roche/Zettyaku); LYMPHOCIDE™ which is a humanized anti- 
CD22 IgG antibody (Immunomedics); ICM3 is a humanized anti-ICAM3 antibody (ICOS 
35 Pharm); LOEC-1 14 is a primatied anti-CD80 antibody (IDEC Phann/Mitsubishi); 

ZEVAHN™ is a radiolabelled murine anti-CD20 antibody (TOEC/Schering AG); DDEC- 
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131 is a humanized anti-CD40L antibody (TDEC/Eisai); IDEC-151 is a primatized anti-CD4 
antibody (DDEC); IDEC-152 is a primatized anti-CD23 antibody (TOEC/Seikagaku); 
SMART anti-CD3 is a humanized anti-CD3 IgG (Protein Design Lab); 5G1.1 is a 
humanized anti-complement factor 5 (C5) antibody (Alexion Pharm); D2E7 is a humanized 

5 anti-TNF-a antibody (CAT/BASF); CDP870 is a humanized anti-TNF-a Fab fragment 
(Celltech); IDEC-151 is a primatized anti-CD4 IgGl antibody (IDEC Phami/SirnthKline 
Beecham); MDX-CD4 is a human anti-CD4 IgG antibody (Medarex/Eisai/Gemnab); 
CDP571 is a humanized anti-TNF-a IgG4 antibody (Celltech); LDP-02 is a humanized anti- 
<x4p7 antibody (LeukoSite/Genentech); OrthoClone OKT4A is a humanized anti-CD4 IgG 

10 antibody (Ortho Biotech); ANTOVA™ is a humanized anti-CD40L IgG antibody (Biogen); 
ANTEGREN™ is a humanized anti-VLA-4 IgG antibody (Elan); and CAT-152 is a human 
anti-TGF-p 2 antibody (Cambridge Ab Tech). 

5.4 PHARMACEUTICAL COMPOSITIONS 

15 The present invention further provides pharmaceutical compositions, formulations, 

dosage units and methods of administration comprising the heterologous proteins produced 
by the transgenic avians using methods of the invneion. Preferably, compositions of the 
invention comprise a prophylactically or therapeutically effective amount of a the 
heterologous protein, and a pharmaceutical^ acceptable carrier. 

20 The term "carrier" refers to a diluent, adjuvant, excipient, or vehicle with which a 

compound of the invention is administered. Such pharmaceutical vehicles can be liquids, 
such as water and oils, including those of petroleum, animal, vegetable or synthetic origin, 
such as peanut oil, soybean oil, mineral oil, sesame oil and the like. The pharmaceutical 
vehicles can be saline, gum acacia, gelatin, starch paste, talc, keratin, colloidal silica, urea, 

25 and the like. In addition, auxiliary, stabilizing, thickening, lubricating and coloring agents 
may be used. When administered to a patient, the compounds of the invention and 
pharmaceutical^ acceptable vehicles are preferably sterile. Water is a preferred vehicle 
when the compound of the invention is administered intravenously. Saline solutions and 
aqueous dextrose and glycerol solutions can also be employed as liquid vehicles, 

30 particularly for injectable solutions. Suitable pharmaceutical vehicles also include 

excipients such as starch, glucose, lactose, sucrose, gelatin, malt, rice, flour, chalk, silica 
gel, sodium stearate, glycerol monostearate, talc, sodium chloride, dried skim milk, 
glycerol, propyieneglycol, water, ethanol and the like. The present compositions, if desired, 
can also contain minor amounts of wetting or emulsifying agents, or pH buffering agents. 

35 The present compositions can take the form of solutions, suspensions, emulsion, 

tablets, pills, pellets, capsules, capsules <x>ntaining liquids, powders, sustained-release 
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formulations, suppositories, emulsions, aerosols, sprays, suspensions, or any other form 
suitable for use. In one embodiment, the phannaceutically acceptable vehicle is a capsule 
(see e.g., U.S. Patent No. 5,698,155). Other examples of suitable pharmaceutical vehicles 
are described in "Remington's Pharmaceutical Sciences" by E.W. Martin. 

5 In a preferred embodiment, the heterologous proteins are formulated in. accordance 

with routine procedures as a pharmaceutical composition adapted for intravenous 
administration to human beings. Typically, compounds of the invention for intravenous 
administration are solutions in sterile isotonic aqueous buffer. Where necessary, the 
. compositions may also include a solubilizing agent. Compositions for intravenous 

10 administration may optionally include a local anesthetic such as lignocaine to ease pain at 
the site of the injection. Generally, the ingredients are supplied either separately or mixed 
together in unit dosage form, for example, as a dry lyophilized powder or water free 
concentrate in a hermetically sealed container such as an ampoule or sachette indicating the 
quantity of active agent. Where the heterologous protein of the invention is to be 

15 administered by infusion, it can be dispensed, for example, with an infusion bottle 
containing sterile pharmaceutical grade water or saline. Where the composition of the 
invention is administered by injection, an ampoule of sterile water for injection or saline can 
be provided so that the ingredients may be mixed prior to adrninistration. 

Compositions for oral delivery may be in the form of tablets, lozenges, aqueous or 

20 oily suspensions, granules, powders, emulsions, capsules, syrups, or elixirs, for example. 
Orally adrninister ed compositions may contain one or more optional agents, for example, 
sweetening agents such as fructose, aspartame or saccharin; flavoring agents such as 
peppermint, oil of wintergreen, or cherry; coloring agents; and preserving agents, to provide 
a phannaceutically palatable preparation. Moreover, where in tablet or pill form, the 

25 compositions may be coated to delay disintegration and absorption in the gastrointestinal 
tract thereby providing a sustained action over an extended period of time. Selectively 
permeable membranes surrounding an osmotically active driving compound are also 
suitable for orally administered compounds of the invention. In these later platforms, fluid 
from the environment surrounding the capsule is imbibed by the driving compound, which 

30 swells to displace the agent or agent composition through an aperture. These delivery 
platforms can provide an essentially zero order delivery profile as opposed to the spiked 
profiles of immediate release formulations. A time delay material such as glycerol 
monostearate or glycerol stearate may also be used Oral compositions can include standard 
vehicles such as mannitol, lactose, starch, magnesium stearate, sodium saccharin, cellulose, 

35 magnesium carbonate, etc. Such vehicles are preferably of pharmaceutical grade. 
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Further, the effect of the heterologous proteins may be delayed or prolonged by 
proper formulation. For example, a slowly soluble pellet of the compound may be prepared 
and incorporated in a tablet or capsule. The technique may be improved by making pellets 
of several different dissolution rates and filling capsules with a mixture of the pellets. 
5 Tablets or capsules may be coated with a film which resists dissolution for a predictable 
period of time. Even the parenteral preparations may be made long-acting, by dissolving or 
suspending the compound in oily or emulsified vehicles which allow it to disperse only 
slowly in the serum. 

10 5.5 TRANSGENIC AVIANS 

Another aspect of the present invention concerns transgenic avians, preferably 
chicken or quail, produced by methods of the invention described in section 5.1 infra, 
preferably by microinjecting a nucleic acid comprising a transgene into an avian embryo by 
the cytoplasmic microinjection methods of the present invention. Following introduction of 

15 the selected nucleic acid into an early stage avian embryo by the methods of the present 
invention, the embryo is transferred into the reproductive tract of a recipient hen. The 
embryo containing the transgene then develops inside the recipient hen and travels through 
the oviduct thereof, where it is encapsulated by natural egg white proteins and a natural egg 
shell. The egg is laid and can be incubated and hatched to produce a transgenic chick. The 

20 resulting transgenic avian chick {i.e, the GO) will cany one or more desired transgene(s) 
some or all of its cells, preferably in its germ line. These GO transgenic avians can be bred 
using methods well known in the art to generate second generation (i.e., Gls) transgenic 
avians that carry the transgene, Le., achieve gerrnline transmission of the transgene. In 
preferred embodiments, the methods of the invention result in gerrnline transmission, Le., 

25 percentage of GOs that transmit the transgene to progeny (Gls), that is greater than 5%, 
preferably, greater than 10%, 20%, 30%, 40%, and, most preferably, greater than 50%, 60%, 
70%, 80%, 90% or even 100%. In other embodiments, the efficiency of transgenesis (Le., 
number of GOs containing the transgene) is greater than 2%, 5%, 10%, 20%, 30%, 40%, 

50%, 60%, 70%, 80% or 99%. 

30 Following maturation, the transgenic avian and/or transgenic progeny thereof; may 

lay eggs containing one or more desired heterologous protein(s) expressed therein and that 
can be easily harvested therefrom. The Gl chicks, when sexually mature, can then be bred 
to produce progeny that are homozygous or heterozygous for the transgene. 

A transgenic avian of the invention may contain at least one transgene, at least two 

35 transgenes, at least 3 transgenes, at least 4 transgenes, at least 5 transgenes, and preferably, 
though optionally, may express the subject nucleic acid encoding a polypeptide in one or 
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more cells Ln the animal, such as the oviduct cells of the chicken. In embodiments of the 
present invention, the expression of the transgene may be restricted to specific subsets of 
cells, tissues, or developmental stages utilizing, for example, cis-acting sequences that 
control expression in the desired pattern. Toward this end, it is contemplated that tissue- 

5 specific regulatory sequences, or tissue-specific promoters, and conditional regulatory 
sequences may be used to control expression of the transgene in certain spatial patterns. 
Moreover, temporal patterns of expression can be provided by, for example, conditional 
recombination systems or prokaryotic transcriptional regulatory sequences. The inclusion 
of a 5' MAR region, and optionally the 3 5 MAR on either end of the sequence, in the 

10 expression cassettes suitable for use in the methods of the present invention may allow the 
heterologous expression unit to escape the chromosomal positional effect (CPE) and 
therefore be expressed at a more uniform level in transgenic tissues that received the 
transgene by a route other than through germ line cells. 

The transgenes may, in certain embodiments, be expressed conditionally, e.g., the 

15 heterologous protein coding sequence is under the control of an inducible promoter, such as 
a prokaryotic promoter or operator that requires a prokaryotic inducer protein to be 
activated. Operators present in prokaryotic cells have been extensively characterized in vivo 
and in vitro and can be readily manipulated to place them in any position upstream from or 
within a gene by standard techniques. Such operators comprise promoter regions and 

20 regions that specifically bind proteins such as activators and repressors. One example is the 
operator region of the lexA gene of E. <:oli to which the LexA polypeptide binds. Other 
exemplary prokaryotic regulatory sequences and the corresponding trans-activating 
prokaryotic proteins are disclosed by Brent and Ptashne in U.S. Patent No. 4,833,080 (the 
contents of which is herein incorporated by reference in its entirety). Transgenic animals 

25 can be created which harbor the subject transgene under transcriptional control of a 

prokaryotic sequence or other activator sequence that is not appreciably activated by avian 
proteins. Breeding of this transgenic animal with another animal that is transgenic for the 
corresponding trans-activator can be used to activate of the expression of the transgene. . 
Moreover, expression of the conditional transgenes can also be induced by gene therapy-like 

30 methods wherein a gene encoding the trans-activating protein, e.g. 9 a recombinase or a 
prokaryotic protein, is delivered to the tissue and caused to be expressed, such as in a cell- 
type specific manner. 

Transactivators in these inducible or repressible transcriptional regulation systems 
are designed to interact specifically with sequences engineered into the transgene. Such 

35 systems include those regulated by tetracycline ("tet systems"), interferon, estrogen, 

ecdysone, Lac operator, progesterone antagonist RU486, and rapamycin (FK506) with tet 
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systems being particularly preferred (see, e.g., Gingrich and Roder, 1998, Annu. Rev. 
Neurosci. 21 : 377-405; incorporated herein by reference in its entirety). These drugs or 
hormones (or their analogs) act on modular transactivators composed of natural or mutant 
ligand-binding domains and intrinsic or extrinsic DNA binding and transcriptional 
5 activation domains. In certain embodiments, expression of the heterologous peptidecan be 
regulated by varying the concentration of the drug or hormone in medium in vitro or in the 

diet of the transgenic animal in vivo. 

In a preferred embodiment, the control elements of the tetracycline-resistance operon 
of E. coli is used as an inducible or repressible transactivator or transcriptional regulation 

10 system ("tet system") for conditional expression of the transgene. A tetracycline-controlled 
transactivator can require either the presence or absence of the antibiotic tetracycline, or one 
of its derivatives, e.g., doxycycline (dox), for binding to the tet operator of the tet system, 
and thus for the activation of the tet system promoter (Ptet). 

In a specific embodiment, a tetracycline-repressed regulatable system (TrRS) is used 

15 (Agha-Mohammadi and Lotze, 2000, J. Clin. Invest. 105(9): 1 177-83; Shockett et aL, 1995, 
Proc. Natl. Acad. Sci. USA 92: 6522-26 and Gossen and Bujard, 1992, Proc. Natl. Acad. 
Sci. USA 89: 5547-5 1 ; incorporated herein by reference in their entireties). 

In another embodiment, a reverse tetracycline-controlled transactivator, e.g., rtTA2 
S-M2, is used. rtTA2 S-M2 transactivator has reduced basal activity in the absence 

20 doxycycline, increased stability in eukaryotic cells, and'increased doxycycline sensitivity 
(Urlinger et aL, 2000, Proc. NatL Acad. Sci. USA 97(14): 7963-68; incorporated herein by 
reference in its entirety). In another embodiment, the tet-repressible system described by 
Wells et aL (1999, Transgenic Res. 8(5): 371-81; incorporated herein by reference in its 
entirety) is used. In one aspect of the embodiment, a single plasmid Tet-repressible system 

25 is used. In another embodiment, the GAL4-UAS system (Ornitz et aL, 1991, Proc. Natl. 
Acad Sci. USA 88:698-702; Rowitch etaL, 1999, L Neuroscience 19(20):8954-8965; 
Wang et aL, 1999, Proc. Natl. Acad Sci. USA 96:8483-8488; Lewandoski, 2001, Nature 
Reviews (Genetics) 2:743-755) or a GAL4-VP16 fusion protein system (Wang et aL, 1999, 
-Proc. Natl. Acad Sci. USA 96:8483-8488) is used 

30 In other embodiments, conditional expression of a transgene is regulated by using a 

recombinase system that is used to turn on or off the gene's expression by recombination in 
the appropriate region of the genome in which the potential drug target gene is inserted. 
The transgene is flanked by recombinase sites, e.g. 9 FRT sites. Such a recombinase system 
can be used to turn on or off expression a transgene (for review of temporal genetic 

35 switches and "tissue scissors 5 ' using recombinases, see Hennighausen & Furth, 1999, Nature 
Biotechnol. 17: 1062-63). Exclusive recombination in a selected cell type may be mediated 
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by use of a site-specific recombinase such as Cre, FLP-wild type (wt), FLP-L or FLPe. 
Recombination may be effected by any art-known method, e.g., the method of Doetschman 
et al (1987, Nature 330: 576-78; incorporated herein by reference in its entirety); the 
method of Thomas et al, (1986, Cell 44: 419-28; incorporated herein by reference in its 

5 entirety); the Cre-loxP recombination system (Sternberg and Hamilton, 1981, J. Mol. Biol. 
150: 467-86; Lakso et al, 1992, Proc. Natl. Acad. Sci. USA 89: 6232-36; which are both 
incorporated herein by reference in their entireties); the FLP recombinase system of 
Saccharomyces cerevisiae (O'Gorman et al, 1991, Science 251: 1351-55); the Cre-loxP- 
tetracycline control switch (Gossen and Bujard, 1992, Proc. Natl. Acad. Sci. USA 89: 5547- 

10 51, incorporated herein by reference in its entirety); and ligand-regulated recombinase 

system (Kellendonk et al, 1999, J. Mol. Biol. 285: 175-82; incorporated herein by reference 
in its entirety). Preferably, the recombinase is highly active, e.g., the Cre-loxP or the FLPe 
system, and has enhanced thermostability (Rodriguez et al, 2000, Nature Genetics 25: 139- 
40; incorporated herein by reference in its entirety). 

1 5 . . 3h a specific embodiment, the ligand-regulated recombinase system of Kellendonk et 
al (1999, J. Mol. Biol. 285: 175-82; incorporated herein by reference in its entirety) can be 
used. In this system, the tigand-binding domain (LBD) of a receptor, e.g., the progesterone 
or estrogen receptor, is fused to the Cre recombinase to increase specificity of the 
recombinase. 

20 In the case of an avian, a heterologous polypeptide or jpolypeptides encoded by the 

transgenic nucleic acid may be secreted into the oviduct lumen of the mature animal and 
deposited as a constituent component of the egg white into eggs laid by the animal. It is 
also contemplated to be within the scope of the present invention for the heterologous 
polypeptides to be produced in the serum of a transgenic avian. 

25 A leaky promoter such as the CMV promoter may be operably linked to a transgene, 

resulting in expression of the transgene in all tissues of the transgenic avian, resulting in 
production of; for example, immunoglobulin polypeptides in the serum. Alternatively, the 
transgene may be operably linked to an avian promoter that may express the transgene in a 
. restricted range of tissues such as, for example, oviduct cells and macrophages so that the 

30 heterologous protein ay be identified in the egg white or the serum of a transgenic avian. 
Transgenic avians produced by the cytoplasmic microinjection method of the present 
invention will have the ability to lay eggs that contain one or more desired heterologous 

protein(s) or variant thereof. 

One embodiment of the present invention, therefore, is a transgenic avian produced 
35 by the cytoplasmic microinj ection methods of the present invention and having a 
heterologous polynucleotide sequence comprising a nucleic acid insert encoding a 
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heterologous polypeptide and operably linked to an avian lysozyme gene expression control 
region, the gene expression control region comprising at least one 5' matrix attachment 
region, an intrinsically curved DNA region, at least one transcription enhancer, a negative 
regulatory element, at least one hormone responsive element, at least one avian CR1 repeat 
5 element, and a proximal lysozyme promoter and signal peptide-encoding region. 

Another embodiment of the present invention provides a transgenic avian further 
comprising a transgene with a lysozyme 3' domain. 

Accordingly, the invention provides transgenic avians produced by methods of the 
invention, preferably by cytoplasmic microinjection as described infra. In preferred 
10 embodiments, the transgenic avian contains a transgene comprising a heterologous peptide 
coding sequence operably linked to a promoter and, in certain embodiments, other 
regulatory elements. In more preferred embodiments, the transgenic avians of the invention 
produce heterologous proteins, preferably in a tissue specific manner, more preferably such 
that they are deposited in the serum and, most preferably, such that the heterologous protein 
15 is deposited into the egg, particularly in the egg white. In preferred embodiments, the 
transgenic avians produce eggs containing greater than 5 fig, 10 [ig, 50 ng, 100 u-g, 250 fig, 
500 fig, or 750 ug, more preferably greater than 1 mg, 2 mg, 5 mg, 10 mg, 20 mg, 50 mg, 
100 mg, 200 mg, 500 mg, 700 mg, 1 gram, 2 grams, 3 grams, 4 grams or 5 grams of the 
heterologous protein. In preferred embodiments, the transgenic avians produce an 
20 immunoglobulin molecule and deposit the immunoglobulin hi the egg or serum of the 

avian, and preferably, the immunoglobulin isolated from the egg or serum specifically binds 
its cognate antigen. The antibody so produced may bind the antigen with the same, greater 
or lesser affinity than the antibody produced in a mammalian cell, such as a myeloma or 
CHO cell. 

25 In specific embodiments, the transgenic avians of the invention were not produced 

or are not progeny of a transgenic ancestor produced using a eukaryotic viral vector, more 
particularly, not a retroviral vector (although, in certain embodiments, the vector may 
contain sequences derived from a eukaryotic viral vector, such as promoters, origins of 
replication, etc.). The transgenic avians of the invention include GO avians, founder 

30 transgenic avians, Gl transgenic avians, avians containing the transgene in the sperm or 
ova, avians mosaic for the transgene and avians containing copies of the transgene in most 
or all of the cells. Contemplated by the invention are transgenic avians in which the 
transgene is episomal. In more preferred embodiments, the transgenic avians have the 
transgene integrated into one or more chromosomes. Chromosomal integration can be 

35 detected using a variety of methods well known in the art, such as, but not limited to, 
Southern blotting, PCR, etc. 
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6. EXAMPLES 

The present invention is further illustrated by the following examples. Each 
example is provided by way of explanation of the invention, and is not intended to be a 
limitation of the invention. In fact, it will be apparent to those skilled in the art that various 

5 modifications, combination, additions, deletions and variations can be made in the present 
invention without departing from the scope or spirit of the invention. For instance, features 
illustrated or described as part of one embodiment can be used in another embodiment to 
yield a still further embodiment. It is intended that the present invention covers such 
modifications, combinations, additions, deletions and variations as come within the scope of 

\0 the appended claims and their equivalents. 

The contents of all references, published patent applications, and patents cited 
throughout the present application are hereby incorporated by reference in their entirety. 

6.1 Example 1 : Cytoplasmic Microinjections 

!5 (a) Preparation ofDNAfor microinjection: The plasmid pAVTJCR-Al 15.93.1.2 

(containing the -12.0 kb lysozyme promoter controlling expression of human interferon 
o2b) was purified with a QIAGEN® Plasmid Maxi Kit (QIAGEN®, Valencia, CA), and 
100 pg of the plasmid were restriction digested with Not\ restriction enzyme. The digested 
DNA \va&phenol/CHCl 3 extracted and ethanol precipitated. Recovered DNA was 

20 resuspended in ImM Tris-HCl (pH 8.0) and O.lmM EDTA, then placed overnight at 4°C. 
- DNA was quantified by spectrophotometry and diluted to the appropriate concentration. 
DNA samples which were bound with the SV40 T antigen nuclear localization signal 
peptide (NLS peptide, amino acid sequence CGGPKKKRKVG (SEQ ID NO: 12)) were 
first resuspended in 0.25 M KC1, and NLS peptide was added to achieve a peptide DNA 

25 molar ratio of 100: 1 (Collas and Alestrom, 1996, Mol Reproa\ Develop. 45: 43 1-438, the 
contents of which are incorporated by reference in its entirety). The DNA samples were 
bound to the SV40 T antigen NLS peptide by incubation for 15 minutes. 

(b) Cytoplasmic microinjections: The germinal disc of the avian egg was positioned 
in, and illuminated by the incident light beam, then the micropipette was moved to a 

30 position whereby the tip of the micropipette was over the area of the germinal disc and 
therefore optimally placed for the insertion of the micropipette into the germinal disc. The 
tip of the micropipette was then pressed onto the vitelline membrane of the avian egg, to a 
depth of about 20 microns below the general plane of the membrane. The vitelline 
membrane resisted penetration by the micropipette and therefore the tip indented the 

35 vitelline membrane without piercing the membrane. The depth of the indentation formed by 
the pressure of the tip of the micropipette on the vitelline membrane can be determined by 
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two methods. The micropipette may be pre-marked about 20 microns from the tip. When 
the mark is about level with the general plane of the membrane, the tip will enter the 
germinal disc once the vitelline membrane is penetrated. The distance for the micropipette 
to be depressed may also be controlled by using the micropipette bevel as reference. In this 
5 method, the injection needle penetrates the vitelline membrane up to a point where only the 
apical end of the opening of the bevel is visible above the vitelline membrane, while the 
remaining of the opening is located inside the germinal disk. The movement of the 
micropipette relative to an avian germinal disc is monitored by the obliquely angled macro 
monitoring unit, comprising a focusable macro lens capable of delivering a focused 
10 magnified image of the avian germinal disc to an electronic camera for display by a 
monitor. The oblique angle of the macro lens shows the depth of movement of the 
micropipette relative to the vitelline membrane and the degree of indentation thereof, more 
distinctly than if a vertical microscope objective is used to monitor the microinjection. 
Pulses of piezo-electric induced oscillations were applied to the micropipette once it was m 
1 5 contact with the indented vitelline membrane. The vibrating tip of the micropipette drills 
through the vitelline membrane. The fluid contents of the micropipette are then injected into 
the germinal disc by positive hydraulic pressure exerted on the lumen and the contents 
therein, by the pressure-regulating system. 

Approximately 100 nanoliters of DNA were injected into a germinal disc of stage 1 
20 White Leghorn embryos obtained two hours after oviposition of the previous egg. DNA 
amounts per injection ranged from 1 nanoliter to 100 nanoliters. 

Injected embryos were surgically transferred to recipient hens via ovum transfer 
according to the method of Christmann et al. (PCT/US01/26723, the contents of which are 
incorporated by reference in its entirety), and hard shell eggs were incubated and hatched 
25 (Olsen andNeher, 1948, J. Exp. Zoo. 109: 355-366). 

6.2 Example 2: PCR Analysis of Chick Blood DNA 

(a) DNA extraction. Whole blood from one-week old chicks was collected with heparinized 
capillary tubes. Red blood cell (RBC) nuclei were released and washed with lysis buffer 

30 solution. DNA's from RBC nuclei were extracted by digestion with proteinase K (lmg/ml) 
and precipitated with ethanol. Purified DNA was resuspended in ImM Tris-HCl (pH 8.0) 
and 0. ImM EDTA and quantitated. 

(b) PCR analysis of chick blood DNA. Genomic DNA samples from one-week old chicks 
were analyzed by PCR using primers LYS05 1 for (5'-TGCATCCTTCAGCACTTC5AG- 

35 30(SEQ ID NO: 13) and IFN-3 (5'-AACTCCTCTTGAGGAAAGCC-3')(SEQ ID NO: 14)). 
This primer set amplifies a 584 bp region of the transgene carried by the pAVTJCR- 
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A115.93.1.2 plasmid. Three hundred nanograms of genomic DNA were added to a 50jul 
reaction mixture (1 X Promega PCR Buffer with 1 .5mM MgCl 2 , 200mM of each dNTP, 
. 5fjM primers) and 1 .25 units of Taq DNA polymerase (Promega). The reaction mixtures 
were heated for 4 minutes at 94°C, and then amplified for 34 cycles at 94°C for 1 min, 60°C 
5 for 1 min and 72 °C for 1 min. The samples were heated in a final cycle for 4 minutes at 
72°C. PCR products were detected on a 0.8% agarose gel with ethidium bromide staining, 
as shown in FIG. 7. 



6.3 Example 3: Human Interferon a2b Expression In Chick Serum 

10 One week after hatching, blood was collected from chicks using heparmized 

capillary tubes. Blood was then added to an equal volume of phosphate buffered saline, 
centrifuged at 200 x g 9 and 100 microliters of the supernatant were assayed by human EFN 
ELISA (PBL Biomedical Laboratories, New Brunswick, New Jersey), as shown in FIGS. 8 
and 9. 



15 



30 



6.4 Example 4: Human Interferon a2b Expression In Egg White of 
Transgenic Hens 



Once hens have reached sexual maturity and began to lay (approximately 22-24 
weeks of age), eggs were collected and the egg whites were assayed by ELISA using human . 
20 IFN ELISA (PBL Biomedical Laboratories, New Brunswick, New Jersey) according to the 
manufacturer's instructions. The results of PCR and ELISA analysis of blood and egg 
white are given in Table 1 below that summarizes results of PCR and ELBA analysis. 
Table 1: Analysis of Transgene Presence and Interferon Expression 
Bird# Nuclear Localization Sex PCR ELISA ELISA 

25 Signal (Blood) (Blood) (egg 

white) 

8305 -NLS M + ' + NA 

8331 F - - + 

8340 -NLS F - - + 

AA123 +NLS F + + NA 

AA61 +NLS M + +" 

AA105 +NLS F + 

AA115 + NLS M + - NA 

-NLS: DNA injected without NLS peptide; + NLS: DNA injected with NLS 

peptide; NA: not applicable. 



35 As shown in Table 1, one bird (#8305) of 69 produced using microinjection of DNA 

without the NLS peptide was positive for both the presence of the transgene and the 
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expression of interferon in the blood. Because this bird is a male, he can be bred to a non- 
transgenic hen to establish germline transmission of the transgene. 

Figs. 8 and 9 demonstrate the expression of human interferon in the blood of #8305, 
as compared to standards. FIG. 7 illustrates the PCR results from the serum of for several 

5 birds, including bird 8305, obtained at different intervals after hatching. As can be seen in 
lanes 4, 5, 1 1, and 12 of FIG. 7, positive signal indicated the presence of the transgene at 
two different collection periods. Other PCR positive bands were seen in birds produced by 
rrncroinjection of DNA covalently linked to the NLS peptide as described above. Table 1 
shows that 4 birds, AA123, AA61, AA105 and AA115, of 43 tested were PCR positive, 

10 ELISA positive or both. Expression levels of human IFN in bird AA6 1 , as compared to 
standards, are also illustrated in FIGS. 8 and 9. PCR-positive male birds can be bred to 
determine germline transmission, and eggs collected from transgenic females to assay for 
IFN expression, as described above, as chicks reach sexual maturity 

6.5 Example 5: Purification and Identification of Human Interferon-a2b from 
Transgenic Eggs 

One hundred eggs were cracked and the egg whites separated from the yolks by 
manual manipulation and pooled. The pooled egg white was solubilized by adding 3 
volumes of deionized water per volume of egg white, followed by adjusting the pH to 5.0 
with the drop-wise addition of IN HC1. The solubilized egg white was clarified by 
20 centrifugation at 3750 g for 20 rrnnutes at 4 °C. 

The solubilized egg white was fractionated by cation exchange chromatography 
using SP-Sepharose HP. Two chromatographic runs were performed, the first in 50 mM 
sodium acetate at pH 5.0, the second in 50 mM sodium acetate at pH 4.0. A commercially 
available EOS A kit specific for human interferon-a was used to identify interferon- 

25 containing fractions. 

The cation-exchange purified material was further purified by hydrophobic 
interaction chromatography on Phenyl-Sepharose, with the interferon fraction eluting after 
the addition of 1M acetic acid, pH 4.5, containing 0.5% triton X-100. 

The results of SDS-PAGE and Western Blot analyses of the products at each step of 
30 the purification procedure are shown inFIGS. 10 and 11 respectively. A peak of interferon 
with a molecular weight of approximately. 22,000 daltons was seen following the 
hydrophobic interaction chromatography step. The purity of the interferon at this stage was 
estimated to be approximately 50%, based on the intensity of staining. 

An analysis of the carbohydrate content of the human IFN-a2b purified from the 
35 transgenic chicken AVI-029 is shown in FIG. 12. Bands 1, 2 and 3 are the unsialyated, 
mono- and disialylated saccharides. Sialic acid linkage is alpha 2-3 to galactose and alpha 

-73- 



WO 03/025146 PCT/US02/29878 

2-6 to N-acetylgalactosamine. The glycosylate of the human IFN-o2b produced by human 
cells is compared to that produced in chicken cells, as shown in FIG. 13. 

6.6 Example 6: Construction of Lysozyme Promoter Plasmids 
The chicken lysozyme gene expression control region was isolated by PCR 
5 amplification. Ligation and reamplification of the fragments thereby obtained yielded a 
contiguous nucleic acid construct comprising the chicken lysozyme gene expression control 
region operably linked to a nucleic acid sequence optimized for codon usage in the chicken 
(SEQ ID NO: 5) and encoding a human interferon o2b polypeptide optimized for expression 
in an avian cell. 

10 White Leghorn Chicken {Gallus gallus) genomic DNA was PCR amplified using the 

primers 5pLMAR2 (SEQ ID NO: 1) and LE-6.1kbrevl (SEQ ID NO: 2) in a first reaction, 
and Lys-6.1 (SEQ ID NO: 3) and LysElrev (SEQ ID NO: 4) as primers in a second reaction. 
PCR cycling steps were: denaturation at 94°C for 1 minute; annealing at 60 °C for 1 minute; 
extension at 72 °C for 6 minutes, for 30 cycles using TAQ PLUS PRECISION DNA 

15 polymerase (STRATAGENE®, LaJolla, CA). The PCR products from these two reactions 
were gel purified, and then united in a third PCR reaction using only 5pLMAR2 (SEQ ID 
NO: 1) and LysElrev (SEQ ID NO: 4) as primers and a 10-minute extension period. The 
resulting DNA product was phosphorylated, gel-purified, and cloned into the £coR V 
restriction site of the vector pBluescript® KS, resulting in the plasmid pl2.0-lys. 

20 P 12.0-lys was used as a template in a PCR reaction with primers 5pLMAR2 (SEQ 

IDNO:l)andLYSBSU 

(5'-CCCCCCCCTAAGGCAGCCAGGGGCAGGAAGCAAA-3') (SEQ ID NO: 5) and a 10 
minute extension time. The resulting DNA was phosphorylated, gel-purified, and cloned 
into the EcoR V restriction site of pBluescript® KS, forming plasmid P 12.01ys-B. 

25 pl2.01ys-B was restriction digested with Not I and Bsu36 L gel-purified, and cloned 

into Not I and 2fcu36 1 digested pCMV-LysSPIFNMM, resulting in pl2.0-lys-LSPIFNMM. 
pl2.0-lys-LSPIFNMM was digested with Sal I and the SalltoNotI primer (5'- 
TCGAGCGGCCGC-3 1 ) (SEQ ID NO: 16) was annealed to the digested plasmid, rollowed 
by Not I digestion. The resulting 12.5 kb Not I fragment, comprising the lysozyme promoter 

30 region linked to IFNMAGMAX-encoding region and an SV40 polyadenylation signal 
sequence, was gel-purified and ligated to Not I cleaved and dephosphorylated 
PBLUESCRIPT® KS, thereby forming the plasmid pAVUCR-Al 15.93. 1 .2, which was then 
sequenced. 



35 
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6.7 Example 7: Construction of Plasmids Which Contain the 3* Lysozyme 

Domain 

The plasmid p AVUCR-A1 1 5 .93 . 1 2 was restriction digested with Fsel and blunt- 
ended with T4 DNA polymerase. The linearized, blunt-ended pAVUCR-Al 15.93. 1.2 

5 plasmid was then digested with Xlwl restriction enzyme, followed by treatment with 
alkaline phosphatase. The resulting 15.4 kb DNA band containing the lysozyme 5' matrix 
attachment region (MAR) and -12.0 kb lysozyme promoter driving expression of a human 
interferon was gel purified by electroelution. 

The plasmid plffilys was restriction digested with MM, then blunt-ended with the 

10 Klenow fragment of DNA polymerase. The linearized, blunt-ended plffilys plasmid was 
digested with Xhol restriction enzyme and the resulting 6 kb band containing the 3' 
lysozyme domain from exon 3 to the 3' end of the 3' MAR was gel purified by 
electroelution. The 15.4 kb band from pAVUCR-Al 15.93.1.2 and the 6 kb band from 
plffilys were ligated with T4 DNA ligase and transformed into STBL4 cells (Invitrogen Life 

!5 Technologies, Carlsbad, CA) by electroporation. The resulting 21.3 kb plasmids from two 
different bacterial colonies were named pAVUCR-A212.89.2.1 and pAVUCR-A212.89.23 
respectively. 

6.8 Example 8: Transfection of Chicken HD11 Cells with pAVIJCR- 

A212.89.2.1 and pAVIJCR-A212.89.2.3 

20 Chicken cells transfected with plasmids having the 3' lysozyme domain linked to a 

nucleic acid expressing human a2b interferon express the heterologous polypeptide. 
Chicken myelomonocytic HD11 cells were transfected with plasmid pA VTJCR-A212.89.2. 1 
and pAVUCR-A2 12.89J2.3 to test the functionality of the plasmids. One million HD1 1 
cells were plated per each well of a 24-well dish. The next day, HD1 1 cells were transfected 

25 with 1 ug of plasmid DNA per 4 ul of LIP OFECT AMINE 2000 (Invitrogen Life 

Technologies). For comparison, independent wells were also transfected with the parent 
vector pAVHCR-Al 15.93.1 2. After 5 hours of transfection, the cell medium was changed 
with fresh medium. 48 hours later, cell medium was harvested by centrifugation at 1 1 0 x g 
for 5 min and assayed for human interferon by ELBA (PBL Biomedicals, Flanders, NT). 

30 The transfected cells expressed the heterologous human a2b interferon at least to the 

level seen with a plasmid not having the 3' lysozyme domain operably linked to the human 

a2b interferon encoding nucleic acid 

63 Example 9: Cytoplasmic Microelectroporation 
The application of electrical current has been shown to enhance the uptake of 
3 5 exogenous DNA fragments by cultured cells. The DNA fragments will be injected into the 

genninal disk according to the above-described methods. Enhancement of nuclear uptake 
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of the heterologous DNA will promote earlier chromosomal integration of the exogenous 
DNA molecules, thus reducing the degree of genetic mosaicism observed in transgenic 
avian founders. 

A sample of nucleic acid will be microinjected into the cytoplasm of a recipient 

5 stage 1 avian cell, and delivered to a recipient cell nucleus by microelectroporation. In a 
system suitable for use in microelectroporating early stage avian cells, a cathode will be 
located within the lumen of the DNA delivery micropipette. Another possible location for 
the electrode is on the exterior surface of the micropipette. For either option, the electrode 
is situated close or adjacent to the exit orifice of the pipette so that the electrode and the 

10 micropipette may be introduced into the recipient cell together. Alternatively, the 
micropipette will be introduced into the cytoplasm and used to guide a cathode to make 
electrical contact with the cytoplasm of the targeted cell. 

The placement of the anode is optional. In one arrangement of the electrodes of the 
microelectroporation system, the anode is located on the micropipette and, therefore, will 

1 5 enter the cell or cells with the micropipette and the cathode. In another arrangement, an 
anode is in electrical contact with the Ringers solution that will surround the targeted 
recipient early stage avian cell. In yet another version, the anode is individually positioned 
within the cytoplasm, or the nucleus, of the recipient stage 1 cell. The anode and cathode 
are electrically connected to an electrical impulse generator capable of delivering a timed , 

20 electrical pulse to the electrodes. One suitable apparatus for generating a timed electrical 
pulse according to the present invention is a Ration Scientific Iontaphorsis pump BAB-500. 

A solution of a selected nucleic acid will be microinjected through the inserted 
micropipette into the recipient cell according to the protocols described in the examples 
above. The recipient cell will be pulsed at least once with about 0. 1 to about 20.0 

25 microamps for about 0.1 to about 60 sees. 

This novel intracellular DNA microelectroporation method will enhance the 
efficiency of transgenesis, increase the efficiency of chromosomal integration of 
heterologous transgenic DNA, and reduce mosaicism of the transgenic founder animal by 
ensuring that more recipient cells receive and incorporate the nucleic acid at each delivery 

30 to a cell than is the case with non-electroporated microinjection. 

6.10 Example 10: Construction of an ALV-based Vector Having p-lactamase 
Encoding Sequences 

The lacZ gene ofpNLB, a replication-deficient avian leukosis virus (ALV>based 
35 vector (Cosset et aL, J. Virol. 65: 3388-94 (1991)), was replaced with an expression cassette 
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consisting of a cytomegalovirus (CMV) promoter and the reporter gene B-lactamase (B-La 
or BL). 

To efficiently replace the lacZ gene ofpNLB with a transgene, an intermediate 
adaptor plasmid was first created, pNLB-Adapter. pNLB-Adapter was created by inserting 

5 the chewed back ApaVApal fragment of pNLB (Cosset et aL, 1991, /. Virol. 65:3388-94) 
(in pNLB, the 5' Apal sites reside 289 bp upstream of lacZ and the 3' Apal sites reside 3' of 
the 3' LTR and Gag segments) into the chewed-back KptiilSacl sites of PBLUESCRIPT®KS(- 
). The filled-in MluVXbal fragment of pCMV-BL (Moore et aL, Anal. Biochem. 247: 203-9 
(1997)) was inserted into the chewed-back KpnVNdel sites of pNLB- Adapter, replacing 

10 lacZ with the CMV promoter and the BL gene (in pNLB, Kpnl resides 67 bp upstream of 
lacZ and Ndel resides 100 bp upstream of the lacZ stop codon), thereby creating pNLB- 
Adapter-CMV-BL. To create pNLB-CMV-BL, the HintiBUBlpl insert of pNLB (containing 
lacZ) was replaced with the HindWBlpl insert of pNLB- Adapter-CMV-BL. This two step 
cloning was necessary because direct ligation of blunt-ended fragments into the Hin<SWBlpl 

15 sites of pNLB yielded mostly rearranged subclones, for unknown reasons. 

6.11 Example 11 : Production of Transduction Particles Having an ALV- 
based Vector Having p-lactamase Encoding Sequences 

Sentas and Isoldes were cultured in F10 (GIBCO®), 5% newborn calf serum 
20 (GIBCO®), 1% chicken serum (GIBCO®), 50 ug/ml phleomycin (Cayla Laboratories) and 
50 ug/ml hygromycin (SIGMA®). Transduction particles were produced as described in 
Cosset et aL, 1991, herein incorporated by reference, with the following exceptions. Two 
days after transfection of the retroviral vector pNLB-CMV-BL (from Example 10, above) 
into 9 x10 s Sentas, virus was harvested in fresh media for 6-16 hours and filtered. All of 
25 the media was used to transduce 3 x 10 6 Isoldes in three 100 mm plates with polybrene 
added to a final concentration of 4 ug/ml. The following day the media was replaced with 
media containing 50 ug/ml phleomycin, 50 ug/ml hygromycin and 200 *ig/ml G418 
(SIGMA®). After 10-12 days, single G418 r colonies were isolated and transferred to 24- 
well plates. After 7-10 days, titers from each colony was determined by transduction of 
30 Sentas followed by G418 selection- Typically 2 out of 60 colonies gave titers at 1-3 x 10 5 . 
Those colonies were expanded and the virus concentrated to 2-7 x 10 7 as described in 
Allioli et aL, 1994, Dev. Biol. 165:30-7, herein incorporated by reference. The integrity of 
the CMV-BL expression cassette was confirmed by assaying for p-lactamase in the media o 
cells transduced with NLB-CMV-BL transduction particles. 
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6.12 Example 12: Production of Chickens Transgenic for p-lactamase 
Stage X embryos in freshly laid eggs were transduced with NLB-CMV-BL 

transduction particles (from Example 11, above) as described in Thoraval et aL 9 1995, 
Transgenic Res. 4:369-377, herein incorporated by reference, except that the eggshell hole 

5 was covered with 1-2 layers of eggshell membrane and, once dry, DUCO® model cement. 

Approximately 120 White Leghorns were produced by transduction of the stage X 
embryos with NLB-CMV-BL transduction particles. These birds constitute chimeric 
founders, not fully transgenic birds. Extensive analysis of DNA in the blood and sperm 
from the transduced chickens indicates that 10-20% of the birds had detectable levels of the 

10 transgene in any given tissue. Of those birds carrying the transgene, approximately 2-15% 
of the cells in any given tissue were actually transgenic. 

6.13 Example 13: p-lactamase Activity Assay in Blood and Egg White 

When hens produced in Example 12, above, began to lay eggs, the egg whites of 
15 those eggs were assayed for the presence of P-lactamase. The P-lactamase assay was carried 
out as described in Moore et aL 9 1997, Anal. Biochenu 247:203-9, herein incorporated by 
reference, with the following modifications. 

To assay blood from two to ten day old chicks, the leg vein was pricked with a 
scalpel. 50 ul of blood was collected in a heparinized capillary tube (Fisher), of which 25 

* ■ 

20 |d was transferred to 100 pi phosphate-buffered saline (PBS) in a 96-welI plate. Various 
dilutions of purified p-lactamase (CALBIOCHEM®) was added to some wells prior to 
addition of blood from control (non-transduced) chicks to establish a p-lactamase standard 
curve. After one day at 4°C, the plate was centrifuged for 10 minutes at 730 x g. 25 ul of 
the supernatant was added to 75 ul of PBS. 100 ul of 20 jxM 7-(thienyi-2-acetamidd)-3-[2- 

25 (4-N,N-dimethylammophen acid 

(PADAC,from CALBIOCHEM®) in PBS was added, and the wells were read immediately 
on a plate reader in a 10 minute kinetic read at 560 run or left overnight in the dark at room 
temperature. Wells were scored positive if the well had turned from purple to yellow. To 
assay blood from older birds, the same procedure was followed except that 200-300 ul 

30 blood was drawn from the wing vein using a syringe primed with 50 ul of heparin 
(SIGMA®). 

Analysis of the NLB-CMV-BL transduced flock revealed nine chickens that had 
significant levels of P-lactamase in their blood. Three of these chickens were males and 
these were the only three males that had significant levels of the NLB-CMV-BL transgene 
35 in their sperm as determined by PCR analysis. Thus, these are the males to be outbred to 
obtain fully transgenic G, offspring. The other six chickens were the hens that expressed p- 
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lactamase in their magnum tissue (see below). Other birds had low levels of p-lactamase 
(just above the level of detection) in their blood but did not have transgenic sperm or eggs 
containing P-lactamase. Thus p-lactamase expression in blood is a strong indicator, of 
whether a chicken was successfully transduced. 
5 To assay p-lactamase in egg white, freshly laid eggs were transferred that day to a 

4°C cooler, at which point the p-lactamase is stable for at least one month. (Bacterially- 
expressed, purified p-lactamase added to egg white was determined to lose niinimal activity 
over several weeks at 4°C, confirming the stability of p-lactamase in egg white.) To collect 
egg white samples, eggs were cracked onto plastic wrap. The egg white was pipetted up 
10 and down several times to mix the thick and thin egg whites. A sample of the egg white 
was transferred to a 96-well plate. 10 ul of the egg white sample was transferred to a 96- 
well plate containing 100 ul of PBS supplemented with 1.5 ul of 1 M NaH 2 P0 4 , pH 5.5 per 
well. After addition of 100 ul of 20 uM PAD AC, the wells were read immediately on a 
plate reader in a 10 minute or 12 hour kinetic read at 560 nm. Various dilutions of purified 
! 5 p-lactamase was added to some wells along with 10 pi of egg white from control (non- 
transduced) hens to establish a p-lactamase standard curve. Egg white from both untreated 
and NLB-CMV-BL transduced hens were assayed for the presence of p-lactamase. 

Significant levels of p-lactamase were detected in the egg white of six hens, as 
shown in Table 2, below. Eggs laid by Hen 1 522, the first hen to demonstrate expression in 
20 eggs, have 0.3 mg or higher of active p-lactamase per egg. Also shown is p-lactamase 
production from three other NLB-CMV-BL transduced hens (Hen 1549, Hen 1790 and Hen 
1593). Every hen mat laid eggs containing p-lactamase also had significant levels of p- 

lactamase in its blood. 

Table 2: Tfrmression of B-lactamase in eggs of NLB-CMV-BL treated hens. 



25 



30 



Hen# 



Average mg of p-lactamase per egg 




# of eggs assayed 



35 



Controls were eggs from untreated hens. The low level of BL in these eggs was due 
to spontaneous breakdown of PADAC during the course of the kinetic assay. The other 
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hens were transduced with NLB-CMV-BL as described in Example 3. Egg white from each 

egg was assayed in triplicate. 

Based on the p-lactamase activity assay, the expression levels ofp-lactamase 
appeared to range from 0.1 to 1.3 mg per egg (assuming 40 milliliters of egg white per egg). 

5 However, these assay quantities were significantly less than the quantities obtained by 
western blot assay and were determined to be deceptively lower than the true values. The 
difference in results between the enzymatic activity assay and a western blot analysis was 
due to a p-lactamase inhibitor in egg white. The activity of purified p-lactamase was 
inhibited by egg white such that 50 ul of egg white in a 200 pi reaction resulted in nearly 

10 100% inhibition, whereas 10 pi of egg white in a 200 pi reaction resulted in only moderate 
inhibition. Furthermore, spontaneous breakdown of the enzymatic substrate, PADAC, 
during the course of the assay also contributed to the erroneously low calculation of p- 
lactamase concentration. 

614 Example 14: Isolation and Ex Vivo Transfection of Blastodermal Cells 
Donor blastodermal cells are isolated from fertilized eggs of Barred Plymouth Rock 
hens using a sterile annular ring of Whatman filter paper which is placed over a blastoderm 
and lifted after cutting through the yolk membrane of the ring. The ring bearing the 
attached blastodenn is transferred to phosphate-buffered saline (PBS) in a petri dish ventral 
20 side up, and adhering yolk is removed by gentle pipetting. The area opaca is dissected away 
with a hair loop and the translucent stage X blastodenn is transferred via a large-bore 
pipette tip to a micxomge tube. About 30,0<HM0,000 cells are isolated per blastodenn and . 
for a typical experiment 10 blastoderms are collected. 

Cells are dispersed by brief trypsin (0.2%) digestion, washed once by low speed 
25 centrifugation in Dulbecco's modified Eagle's medium (DMEM) and then transfected with 
linearized plasmids via lipofectin (16 mg/200 ml, BRL) for 3 hours at room temperature. 

i iv— . <innj;nn ncWs ztp. fniected into S- 



15 













mm 




r 



(AC line). Injection is through a small window (-0.5 cm) into the subgenninal cavity 
30 beneath the recipient blastoderms. Windows are sealed with fresh egg shell membrane and 
DUCO® plastic cement. Eggs are then incubated at 39.1° C in a humidified incubator with 
90° rotation every 2 hr. 

6.15 Example 15: Identification of Transgenic Mosaics by PCR Assay 
35 Among the chicks which hatch from embryos containing transfected or transduced 

blastodermal cells, only those exhibiting Barred Plymouth Rock feather mosaicism are 
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retained. Even if no reporter gene is present in the transgene, transgenic mosaics can be 

identified by PGR assay. 

To identify transgenic mosaics, DNA blood and black feather pulp of individual 
chicks are assayed by PCR for the presence of the transgene using a primer pair specific to 
5 the transgene as described by Love et aU 1 994, Bio/Technology 12:60-63. Transgene 
chimeras are induced, withdrawn and re-induced with diethylstilbestrol (DES) pellets and 
excised magnums analyzed for expression of reporter activity. Blood and fiver are assayed 

to monitor tissue specificity. 

Male and female blood DNA was collected at 10 to 20 days post-hatch. Blood is 

10 drawn from a wing vein into a heparinized syringe and one drop is immediately dispensed 
into one well of a flat-bottom 96-well dish containing a buffer which lyses cytoplasmic 
membranes exclusively. The plate is then briefly centrifuged, which pellets the nuclei. The 
supernatant is removed and a second lysis buffer is added which releases genomic DNA 
from nuclei and degrades nucleases. The DNA is ethanol precipitated in the plate, washed 

15 with 70% ethanol, dried and resuspended in 100 fA of water per well. As much as 80 Mg of 
DNA suitable for PCR and TAQMAN™ (Perkin Elmer/Applied Biosystems) analysis can 
be obtained from one drop (8 fA) of chick blood. 

The isolated DNA is tested for the presence of the transgenes using the TAQMAN® 
sequence detection assay to evaluate the efficiency of the embryo transduction process. The 

20 TAQMAN® sequence detection system allows the direct detection of a specific sequence. 
A fluorescently-labeled oligonucleotide probe complementary to an internal region of a 
desired PCR product only fluoresces when annealed to the desired PCR product, which in 
this case is complementary to the transgene. Because all of the detection occurs in the PCR 
tube during the cycling process, the TAQMAN® system allows high-throughput PCR (no 

25 gel electrophoresis is need) as well as sequence detection analogous to and as sensitive as 
Southern analysis. 1 fA of the isolated DNA, which contains 600-800 ng of DNA, is used 
for the TAQMAN® reaction. Each reaction contains two sets of primer pairs and 
TAQMAN® probes. The first set detects the chicken glyceraldehyde 3-phosphate 
dehydrogenase gene (GAPDH) and is used as an internal control for the quality of the 

30 genomic DNA and also serves as a standard for quantitation of the transgene dosage. The 
second set is specific for the desired transgene. Fluorescence is detected in a dissecting 
stereomicroscope equipped with epifluorescence detection. The two TAQMAN® probes 
are attached to different dyes that fluoresce at unique wavelengths: thus both PCR products 
are detected simultaneously in an ABI/PE 7700 Sequence Detector. It is estimated that up to 

35 180 birds will hatch, and 20% (36 birds) will contain the transgene in their blood. 
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6.16 Example 16: Production of Fully Transgenic G, Chickens Expressing p- 
Lactamase 

Males are selected for breeding as a single male can give rise to 20 to 30 G^ 
offspring per week as opposed to 6 G, offspring per female per week, thereby speeding the 

5 expansion of G, transgenics. The feed of G 0 males is supplemented with sulfamethazine, 
which accelerates the sexual maturation of males such that they can start producing sperm at 
10-12 weeks of age instead of 20-22 weeks without influencing their health or fertility. 

Sperm DNA of all males are screened for the presence of the transgene. Sperm are 
collected and the DNA extracted using Chelex-100. Briefly, 3 fA of sperm and 200 fA of 

10 5% Chelex-100 are mixed, followed by addition of 2 fA of 10 mg/ml proteinase K and 7 fA 
of2MDTT. Samples are incubated at 56°C for 30-60 minutes. Samples are boiled for 
8 minutes and vortexed vigorously for 10 seconds. After centrifugation at 10 to 15 kG for 
2-3 minutes, the supernatant is ready for PCR or TAQMAN® analysis. The DNAs are 
analyzed by the TAQMAN® assay using a TAQMAN® probe and primers complementary 

15 to the transgene. Of the 90 G 0 males, it is estimated that 5%, or 4 to 5, will have the 

transgene in their sperm DNA. 

As noted above in Example 13, the NLB-CMV-BL transduced flock included three 
males that had significant levels of the NLB-CMV-BL transgene in their sperm as 
, determined by PCR analysis. Thus, these males 37s chosen for further breeding to obtain 

20 fully transgenic G x offspring. 

By breeding germline transgenic males to 90 non-transgenic White Leghorn females 
per week, about 16 G x offspring per week will be obtained. Hatched chicks are vent-sexed 
and screened for the presence of the transgene in their blood DNA by the TAQMAN® 
assay. Twenty male and female G, transgenics will be obtained or 40 total, which will take 

25 up to 3 weeks. 

Males will be kept for further breeding and females tested for expression of 
transgenes in the egg. 



30 



6.17 Example 17: pNLB-CMV-IFN Vector Having an IFN Encoding 
Sequence 



The DNA sequence for human interferon a2b based on hen oviduct optimized codon 
usage was created using the BACKTRANSLATE program of the Wisconsin Package, 
version 9.1 (Genetics Computer Group. Inc., Madison, WI) with a codon usage table 
compiled from the chicken (Gallus gallus) ovalbumin, lysozyme, ovomucoid, and 
35 ovotransferrin proteins. The template and primer oligonucleotides (SEQ ID NOS: 17-34) 
shown in Fig. 15A-B were amplified by PCR with Pfu polymerase (STRATAGENE®, La 
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Jolla, CA) using 20 cycles of 94°C for 1 min., 50°C for 30 sec, and 72°C for 1 min. and 10 
sec. 

PCR products were purified from a 12% polyacrylamide-TBE gel by the "crush and 
soak" method (Maniatis et al 1982), then combined as templates in an amplification 

5 reaction using only EFN-1 (SEQ ID NO: 24) and IFN-8 (SEQ ID NO: 34) as primers. The 
resulting PCR product was digested with Hind III and Xba I and gel purified from a 2% 
agarose-TAE gel, then ligated into Hind m and Xba I digested, alkaline phosphatase-treated, 
PBLUESCRIPT® KS (STRATAGENE®), resulting in the plasmid pBluKSP-IFNMagMax. 
Both strands were sequenced by cycle sequencing on an ABI PRISM 377 DNA Sequencer 

10 (Perkin-Elmer, Foster City, CA) using universal T7 or T3 primers. Mutations in pBluKSP- 
IFN derived from the original oligonucleotide templates were corrected by site-directed 
mutagenesis with the Transformer Site-Directed Mutagenesis Kit (Clontech, Palo Alto, 
CA). The interferon coding sequence was then removed from the corrected pBluKSP-IFN 
with Hind JR and Xba 1, purified from a 0.8% agarose-TAE Gel, and ligated to Hind HI and 

15 Xba I digested, alkaline phosphatase-treated pCMV-BetaLa-3B-dH. The resulting plasmid 
was pCMV-IFN which contained IFN coding sequence controlled by the cytomegalovirus 
immediate early promoter/enhancer and S V40 polyA site. 

To clone the IFN coding sequence controlled by the CMV promoter/enhancer into 
the NLB retroviral plasmid, pCMV-IFN was first digested with CM sn&Xbal, then both 

20 ends were filled in with Klenow fragment of DNA polymerase (New England BioLabs, 
Beverly, MA). pNLB-adapter was digested with Nde I and Kpn I, and both ends were made 
blunt by T4 DNA polymerase (New England BioLabs). Appropriate DNA fragments were 
purified on a 0.8% agarose-TAE gel, then ligated and transformed into DH5a cells. The 
resulting plasmid was pNLB-adapter-CMV-IFN. 

25 This plasmid was then digested with Mlu I and partially digested with Blp I and the 

appropriate fragment was gel purified. pNLB-CMV-EGFP was digested with Mlu I and Blp 
\ then alkaline-phosphatase treated and gel purified. The Mlu VBlp I partial fragment of 
pNLB-adapter-CMV-IFN was ligated to the large fragment derived from the Mlu VBlp I 
digest of pNLB-CMV -EGFP, creating pNLB-CMV-IFN. 

30 

6.18 Example 18: Production of pNLB-CMV-BFN Transduction Particles 

Senta packaging cells (Cosset et al, 1991) were plated at a density of 3 x 10 5 
cells/35 mm tissue culture dish in F-10 medium (life Technologies) supplemented with 
50% calf serum (Atlanta Biologicals), 1% chicken senim (Life Technologies), 50 ng/ml 
35 hygromycin (SIGMA®), and 50 jig/ml phleomycin (CAYLA, Toulouse, France). These 
cells were transfected 24h after plating with 2 \ig of CsCl-purified pNLB-CMV-IFN DNA 
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and 6 p.1 of Lipofectin liposomes (Life Technologies) in a final volume of 500 ul Optimem 
(Life Technologies). The plates were gently rocked for four hours at 37° C in a 5% C0 2 
incubator. For each well, the media was removed, washed, once with 1 ml of Optimem and 
re-fed with 2 mis of F-10 medium supplemented with 50% calf serum, 1% chicken serum, 

5 50 ^g/ml hygromycin, and 50 /xg/ml phleomycin. The next day, medium from transfected 
Sentas was recovered and filtered through a 0.45 micron filter. 

This medium was then used to transduce Isolde cells. 0.3 ml of the filtered medium 
recovered from Senta cells was added to 9.6 ml of F-10 (Life Technologies) supplemented 
as described above, in addition to polybrene (SIGMA®) at a final concentration of 4 ug/ml. 

10 This mixture was added to 10 6 Isolde packaging cells (Cosset et al. 9 1991) plated on a 

100mm dish the previous day, then replaced with fresh F-10 medium (as described for Senta 
growth) 4 hours later. 

The next day, the medium was replaced with fresh medium which also contained 
200 ug/ml neomycin (G418, SIGMA®). Every other day, the medium was replaced with 

15 fresh F-10 medium supplemented with 50% calf serum, 1% chicken serum, 50 Mg/ml 
hygromycin, 50 //g/ml phleomycin, and 200 pig/ml neomycin. Eleven to twelve days later, 
single colonies were visible by eye, and these were picked and placed into 24 well dishes. 
When some of the 24 well dishes became confluent, medium was harvested and titered to 
determine the cell lines with the highest production of retrovirus. 

20 Titering was performed by plating 7.5 x 10 4 Senta cells per well in 24 well plates on 

the day prior to viral harvest and transduction. The next day 1 ml of fresh F-10 medium 
supplemented with 50% calf serum, 1% chicken serum, 50 jug/ml hygromycin, and 50 
pig/ml phleomycin was added to each well of the isolated Isolde colonies. Virus was 
harvested for 8-10 hours. The relative density of each well of Isoldes was noted. After 8-10 

25 hours, 2 and 20 {A of media from each well of Isoldes was added directly to the media of 
duplicate wills of the Sentas. Harvested medium was also tested for the presence of 
interferon by IFN ELBA and for interferon bioreactivity. The next day the media was 
replaced with F-10 medium supplemented with 50% calf serum, 1% chicken serum, 50 
/xg/ml hygromycin, 50 /zg/ml phleomycin, and 200 ^g/rnl neomycin. When obvious 

30 neomycin-resistant colonies were evident in the wells of transduced Sentas, the number of 
colonies was counted for each well. 

The Isolde colony producing the highest titer was determined by taking into account 
the number of colonies and correcting for the density of the Isolde cells when the viral 
particles were harvested (i.e., if two Isolde colonies gave rise to media with the same titer, 

35 but one was at a 5% density and the other was at a 50% density at the time of viral harvest, 
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m at the 5% density was chosen for farther work, as was the case in the present 

example). « . « 

The Isolde cell line producing the highest titer of IFN-encoding transducmg parties 

was scaled up to six T-75 tissue culture flasks. When flasks were confluent, cells were 
, washed with F-10 medium (unsupplemented) and transducing particles were 

for 16 hours in 14 ml/flask of F-10 containing 1% calf serum (Atlanta BiologLcals) and 0.2/. 
chicken serum (life Technolocyies). Medium was harvested, filtered through a 0.45 micron 
syringe filter, then centrifuged at 195,000xg in a Beckman 60Ti rotor for 35 mm Lxqmd 
^removed except for 1 ml, and this was incubated with the pellet at 37'C with gentle 
10 shaking for one hour. Aliquots were frozen at -70°C. Transducing particles were then 
titered on Senta ceUs to determine concentrations used to inject embryos. 

6.19 Example 19: Production of Chimeric Transgenic Chickens 

Approximately 300 White Leghorn (strain Line 0) eggs were windowed according to 
15 the Speksnijder procedure described in U.S. Patent No. 5,897,998, incorporated heremby 
reference in its entirety, then injected with about 7 x Iff transducing parties P-^Eggs 
hatched 21 days after injection and human interferon levels were measured by IFN ELISA 
from serum samples collected from chicks one week after hatch. 

* 20 Examine 20- Production of Fully Transgenic G, Chickens for Selective 
6.20 E^P^^f,^ Males Expressing Human In« rferon 

To screen for G 0 roosters which contained the interferon transgene in their sperm, 

DNA was extracted from rooster sperm samples by Chelex-100 extraction (Walsh */ al, 

icon DNA samples were then subjected to TAQMAN® analysis on a 7700 Sequence 
1991). DMA samples j TGGATTGCACGCAG GTTCT-3') (SEQ 

75 Detector (Perkin Elmer) usmg the 'neo for-1 p -luuAnuv,n _, Tr . 
ID NO- 35) and «neo rev-1" (5 '-GTGCCCAGTC ATAGCCGAAT-3 ') (SEQ ID NO: 36) 
XI IdFAMlabeledNEO-PROBEl (5'-CCTCTCCACCCAAGCGGCCG-3') (SEQ 

©NO: 37) to detect me tr^^^ ^ 
transgene in their sperm samples were bred to nontransgenic SPAFAS (White Leghorn) 

30 hens by artificial inseminatioiL 

Blood DNA samples were screened for the presence of the transgene by 

, . j -u-a T? v *mnle 14 above. Out of 1,597 offspring, one 
TAQMAN® analysis as described in Example I**, aouvc. 

rooster was found to be transgenic (a.ka. «A]phie"). Alpbie's serum was tested for the 
presence of human interferon by bJFN ELISA. hlFN was present at 200 nanograms/ml. 

Alphie's sperm was used for artificial insemination of nontransgenic SPAFAS 
(White Leghorn) hens. To date, 106 out of 202 (about 52%) offspring contain the transgene 
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,o hv T A OMAN® analysis. These breeding results follow a Mendelian inheritance 

pattern and indicate that Alphie is transgenic. 

■ ■ . > * 

6.21 Example 21: Production of Human Interferon «2b in the Egg White of G, 

Transgenic Hens 

Human lung carcinoma cells were incubated with diluted egg white samples, then 
washed and challenged with mengovirus. After incubation, cells were stained with crystal 

violet to assess viral interference. 

Expression levels of human JEN a2b in egg white produced by G 2 hens as 
10 detenmned by ELISA are shown in Fig. 16. The bioactivity versus the mass of human IFN 
a2b produced in G 2 hen egg white is shown in Fig. 17. Bioactivity was detenmned by a 
viral inhibition assay, and mass was determined by IFN ELISA. Bird number 53 was a 
control bird and represented egg white from a non-transgenic hen. 

15 6.22 Example 22: Transection of Cultured Quail Oviduct Cells 

The oviduct was removed from a Japanese quail (Cotumix coturnix japomca) and 
the magnum portion was minced and enzymatically dissociated with 0.8 mg/ml collagenase 
(SIGMA® Chemical Co., St. Louis, MO) and 1.0 mg/ml dispase (ROCHE® Molecular 
Biochenucals, Indianapolis, IN) by shaking and uturating for 30 min at 37°C. The cell 
20 suspension was then filtered through sterile surgical gauze, washed three times with F-12 
medium (Life Technologies, Grand Island, NY) by centrifugation at 200 x g , ana 
resuspended in OPTJMEM™ (Life Technologies) such that the aD„ was approximately 2. 
300 ul of cell suspension was plated per well of a 24-well dish. For each transfecuon, 2.5 
pi of DMR1E-C liposomes (Life Technologies) and 1 ug of DNA were preincubated 15 
25 minutes at room temperature in 100 ul of OPTIMEM™ then added to the oviduct cells. 
Cells with DNA/liposomes were incubated for 5 hours at 37°C in 5% CO, Next, 0.75 ml 
of DMEM (Life Technologies) supplemented with 1 5% fetal bovine serum (FBS) (Atlanta 
Biologicals, Atlanta, GA), 2X penicmin/streptomycin (Life Technologies), 10* M insulin 
(SIGMA®) 10 8 M (J-estradiol (SIGMA®), and 10" 7 M corticosterone (SIGMA®) was 
30 added to each well, and incubation continued for 72 hours. Medium was then harvested and 
centrifiiged at 1 10 x g for 5 minutes. 



35 
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6.23 Example 23: Transfection of Cultured Chicken Whole Embryo 
Fibroblasts 

To obtain whole embryo fibroblasts (WEFs), fertile chicken eggs were incubated for 
approximately 65 hours. Embryos were collected using filter paper rings, then washed three 
5 times in phosphate buffered saline with glucose (PBS-G) followed by a wash in calcium- 
and magnesium-free EDTA (CMF-EDTA). Embryos were then incubated in fresh CMF- 
EDTA at 4°C with gentle shaking for 30 minutes. CMF-EDTA was removed, and replaced 
with 0.5% trypsin solution (no EDTA) at 37°C for 3 minutes. Cells were titurated 10 times, 
then 5% chicken serum was added to inhibit the trypsin reaction. The cell suspension was 
10 then added to a-MEM (Life Technologies) supplemented with 2.2 g/1 NaHC0 3 , 2.52 g/L 
EPPS, 0.18 g/lD-glucose, 5% FBS, 5% chick serum (heat inactivated at 55°C for 1 hour), 
5xlO" 5 M p-mercaptoethanol, 0.2 mM L-glutamine, 2X penicillin/streptomycin and 
centrifuged. Cells were resuspended in a-MEM supplemented as described above, and 
plated on 6-well dishes at a density of 2 x 10 5 cells per well. 
15 For each transfection, 6 ul of FuGene 6 liposomes (ROCHE® Molecular 

Biochemicals) and 2 ug of DNA were preincubated 15 min at room temperature in 100 ul 
of OPTIMEM™, then added to the WEFs. WEFs with DNA/liposomes were incubated 5 
hours at 37 °C in 5% C0 2 . The transfection medium was then removed and replaced with 2 
ml of a-MEM supplemented as described above. Medium was removed 72 hours after 
20 transfection arid centrifuged at 1 10 x g for 5 minutes. 

WEFs were transfected either with the heavy and light immunoglobulin polypeptides 
encoded by separate plasmids (pl083 and pl086 respectively) each under the control of the 
CMV promoter or encoded on the same reactor under the transcriptional control of a CMV 
promoter and including an IRES translational element as described in U.S. Patent 
25 Application No. 09/977,374, filed 08 June 2001 and incorporated herein by reference in its 
entirety. The supematants were analyzed for antibody content by ELISA and FACs. 

6.24 Example 24: Generation of Transgenic Chickens Expressing Antibodies 

A retroviral vector, based on either avian leukosis virus (ALV) or Moloney murine 
30 leukemia virus (MoMLV), will be constructed such that the light (L) and heavy (H) chains 
of a monoclonal antibody (MAb) will be linked by an internal ribosome entry site (IRES) 
element. Both genes will then be transcriptionally regulated by a promoter such as the 
cytomegalovirus (CMV) immediate early promoter/enhancer or a promoter that 
demonstrates tissue specificity for the hen oviduct (for example, the lysozyme promoter, 
35 ovalbumin promoter, an artificial promoter construct such as MDOT, and the like). The 
promoter-L chain-IRES-H chain DNA expression cassette will be flanked by the long 
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terminal repeats (LTRs) of the retrovirus. Stage X chicken embryos will be injected with 
transducing particles containing the above construct to generate transgenic chickens. 

Alternatively, the heavy and light chains will be included in separate retroviral 
vectors and separate lines of transgenic chickens will be generated. Each line will either 
5 express the heavy or light chain of the MAb. Once germline transmission of the transgene 
is established in the two lines, they will be bred to each other to express heavy and light 
chains together to make functional MAbs in the offspring. 

The above DNA constructs can also be integrated into a chicken genome by sperm- 
mediated transgenesis (SMT). SMT may involve transfection, electroporation, or 
10 incubation of sperm with the desired DNA construct (for example, the lysozyme promoter 
controlling expression of heavy and light chains of the MAb) and fertilization of ovum with 
the treated sperm by artificial insemination or by chicken intracytoplasmic sperm injection 
(ChlCSl™). 

1 5 6.25 Example 25: Preparation of Recipient Avian Cytoplasts by TPLSM 

Incubation 

Ova were isolated from euthanized hens between 2-4 hours after opposition of the 
previous egg. Alternatively, eggs were isolated from hens whose oviducts have been 
fistulated (Gilbert & Woodgush, 1963, J.Reprod. &Fertility 5: 451-453) and (Pander et 

20 aL, 1989, Br. Poult. Sri. 30: 953-7). Before generating images of the avian early embryo, 
DNA was incubated with a specific dye according to the following protocol. 

The albumen capsule was removed and the ovum placed in a dish with the germinal 
disk facing the top. Remnants of the albumen capsule were removed from the top of the 
germinal disk. Phosphate buffered saline was added to the dish to prevent drying of the 

25 ovum. A cloning cylinder was placed around the germinal disk and 1.0/zg/ml of DAPI in 
PBS was added to the cylinder. Visualization was performed after approximately 15 
minutes of incubation. 
Injection 

Preparation of the egg was done as described for incubation. Following removal of 
30 the capsule, 10-50 nanoliters of a 0.1 Mg^l solution of DAPI in PBS was injected into the 
germinal disk using a glass pipette. Visualization was performed approximately 1 5 mmutes 

after injection. 
Visualization 

Following incubation, images of the inside of the avian early embryo were generated 
35 through the use of TPLSM. The germinal disk was placed under the microscope objective, 
and the pronuclear structures were searched within the central area of the disk, to a depth of 
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using low laser power of 3-6 milliwatts at a wavelength of 750 nm. Once the 
structures were found they were subsequently ablated. 
Nuclear Ablation and Enucleation 

Pronuclear structures were subjected to laser-mediated ablation. In these 

5 experiments, an Olympus 20x/0.5NA (Numerical Aperture) water immersion lens was used. 
The x and y planes to be ablated were defined with the two photon software, while the z 
plane (depth) was just under lOum for this type of objective. Since the pronuclear structure 
was about 20 urn in diameter, the ablation comprised two steps (2 times lOum). The focal 
point was lowered to visualize the remaining of the pronucleus, which was subsequently 

10 ablated. The laser power used to ablate the pronuclei was between 30 to 70 milliwatts at a 
wavelength of 750 nm. For the ablation experiments, the image was zoomed by a factor of 
.4 to 5, giving an area compression of 16-25 fold. Then the power was increased 10-12 fold 
for a total intensity increase of 160-300 fold compared to the visualization intensity of 3-6 
milliwatts. The ablation intensity (power density) is the functional parameter, i.e. the power 

15 increase of 10-12 fold results in ablation power of 30-70 milliwatts, but the zoom factor 
compressed this power into an area 16-25x smaller giving a power density increase of 160- 
300 fold. 



20 



6.26 Example 26: Preparation of the Nuclear Donor Cell and Isolation of the 
Donor Nucleus 



Avian fibroblast cells in culture were trypsinized (0.25% Trypsin and 1/jM EDTA), 
centrifuged twice in PBS containing 5% of fetal calf serum (FCS) and placed in a 60 mm 
plastic dish in PBS containing 5% of FCS. Using the microscope/micromanipulation unit 
described in Example 27 below, under transmission light, the nuclear donors were then 
25 isolated by repeated pipetting of the cells, which disrupted the cytoplasmic membrane and 
released the nucleus from inside the cell. 



6.27 Example 27: Preparation of the Reconstructed Zygote 
A micromanipulation unit, comprising an IM-16 rnicroinjector and a MM-188NE 
30 micromanipulator, both from NIKON®/MARISHIGE, were adapted to an upright 

NIKON® Eclipse E800. This microscope was adapted to operate under both transmission 
and reflective light conditions. This unique configuration has allowed us to 
morphologically examine and prepare (isolate the nuclei, as described above) somatic cells 
in suspension and to load the injection pipette using dry or water immersion lenses under 
35 diascopic iUumination or transmitted light This was followed by prompt localization and 
positioning of the germinal disk under the microscope and subsequent guided injection of 
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the somatic cells, using dry and long distance lenses under fiber optic as well as episcopic 
illumination (fight coming from the side and through the objectives onto the sample 
respectively). 

5 6.28 Example 28 : Production of Transgenic Chickens by Direct Pronuclear 

DNA Injection 

Production of transgenic chickens by direct DNA injection can be by two methods: 
(a) injection of a DNA directly into the germinal disk, commonly described as cytoplasmic 
injection, as described for avian species by Sang & Perry, 1989, Mol. Reprod. Dev. 1: 98- 

10 106, and Love et al, 1994, Biotechnology (N.Y.) 12: 60-3, incorporated herein by reference 
in their entireties. Sang & Perry described only episomal replication of the injected cloned 
DNA. Love et al. suggested that the injected DNA becomes integrated into the cell's 
genome. In both cases, injection was into pronuclear stage eggs. This procedure, therefore, 
is cytoplasmic injection of pronuclear stage eggs, not pronuclear injection; and (b) imaging 

15 of the egg using multiphoton microscopy to allow localization of the pronuclear structures. 
The DNA solution is then injected directly into the pronucleus. 
DNA preparation 

The plasmid pAVUCR-Al 15.93.1.2 containing the chicken lysozyme promoter 
region, and controlling expression of human interferon a2b, was purified with a QIAGEN® 
20 Plasmid Maxi Kit (QIAGEN®, Valencia, CA), and 5 ug of the plasmid DNA were 

restriction digested with the restricuon^zyme Not I A 12.7 kb fragment was purified by 
gel electrophoresis and electroeiution, phenol/chloroform extraction, and ethanol 
precipitation. The DNA was resuspended in ImM Tris-HCl, pH8.0 and O.lmM EDTA 
(0.1X TE) to a final concentration of 5pgml and then used for microinjections. 

25 Pronuclear injection 

(i) Preparation of ova. Ova were isolated from euthanized hens between two and four 
hours after opposition of the previous egg. Alternatively, eggs were isolated from hens 
whose oviducts have been fistulated as described by Gilbert & Woodgush, 1963, /. of 
Reprod. and Fertility 5: 451^53 and Pander et al, 1989, Br. Poult. Sci. 30: 953-7 and 

30 incorporated herein in their entireties. 

The albumen capsule was removed and the ovum placed in a dish with the germinal 
disk facing upwards. Remnants of the albumen capsule were removed from over the 
genninal disk. Phosphate buffered saline (PBS) was added to the dish to prevent drying of 
the ovum. A cloning cylinder could be placed around the genninal disk to reduce the 

35 depression of the ooplasmic membrane formed during subsequent pipette penetration, 
thereby facilitating the injection. 
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(ii) Injection. Between about 1-100 nanoliters of DNA solution was injected into a germinal 
disk using a glass pipette after removal of the capsule. The microinjection assembly and 
methods for microinjecting and reimplanting avian eggs are fully described in U.S. Patent 
Applications. 09/919,143, filed 31 July 2001. 
5 Briefly, the microscope/micromanipulation unit is an HM-16 micrbinjector and a 

MM-188NE micromanipulator, both from NKON®/MARISHIGE, adapted to an upright 
NIKON® Eclipse E800 microscope adapted to operate under both transmitted and reflected 
light conditions. This unique configuration allows the loading of a DNA solution into a 
micropipette while observed with a pipette dry or water immersion lenses under diascopic 
10 illumination or transmitted light. Pipette loading is followed by the prompt localization and 
positioning of the genninal disk under the microscope and subsequent guided injection of 
DNA solution into the germinal disk using dry and long working distance lenses under fiber 
optic as well as episcopic iUiimination (side mumination and directly through the objectives 
and onto the sample, respectively). 
15 (Hi) Localization of the Avian Embryo. A cloning cylinder is placed around the germinal 
disk and MlTOTRACKER® (300 nM) in PBS was added to the cylinder. Visualization is 
performed after approximately 20 minutes of incubation. Imaging using this dye shows 
intense labeling of the region around the nucleus while the nucleus itself does not take up 
the dye. This will allow localization of the pronucleus for nation while not causing 
20 excessive damage to its structure, since the content of the pronuclei are not labeled and 
therefore are bleached during imaging. Once the pronucleus is localized, the DNA solution 
can be delivered into it using a microinjector. Cytoplasmic or pronuclear injected eggs can 
then be surgically transferred to a recipient hen. 

(vv) Ovum transfer. At the time of laying, recipient hens are gas anesthetized using 
25 Isofluorine. At this time, the infundibulum is receptive to receiving a donor ovum but has 
not yet ovulated. Feathers are removed from the abdoniinal area, and the area is scrubbed 
with betadine, and rinsed with 70% ethanol. The bird is placed in a supine position and a 
surgical drape is placed over tie bird with the surgical area exposed. An incision 
approximately 2 inches long is made beginning at the junction of the sternal rib to the 
30 breastbone and running parallel to the breastbone and through the smooth muscle layers and 
the peritoneum, to locate the infundibulum. The infundibulum is externalized and opened 
using gloved hands and the donor ovum is gently applied to the open infundibulum. The 
ovum is allowed to move into the infundibulum and into the anterior magnum by gravity 
feed. The infundibulum is returned to the body cavity and the incision closed using 
35 interlocking stitches both for the smooth muscle layer and the skin. The recipient hen is 
returned to her cage and allowed to recover with free access to both feed and water. 
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Recovery time for the bird to be up, moving and feeding is usually within 45 minutes. Eggs 
laid by the recipient hens are collected the next day, set, and incubated. They will hatch 2 1 
days later. 

The procedure described by Love et ah, 1994, in Biotechnology (N.Y.) 12:60-63, 
resulted in 5.5% survival to sexual maturity using the Perry ex ovo procedure. Following 
injection and surgical transfer by the methods described herein, however, a survival rate 
between about 50% and about 70% is expected, i.e., hatching, and most of the hatched birds 
should reach maturity. 

6.29 Example 29:MuLV and VSV Viral Transfection of Avian Eggs 
Preparation ofMuLV/VSVg viral stocks. GP-293 cells at 70-80% confluence were 
transfected with 10 ug of the plasmid pVSVg or pLNHX-CMVE-MDOT-IFN. Sixty hours 
after transfection, the supernatant was collected and centrifuged at 1000 rpm for 5 nunutes 
to remove cells. The supernatant was filtered through a 0.45 micron filter and the filtrate 
15 was centrifuged at 20,000 rpm to pellet the virus. The viral pellet was resuspended in 400 
ml of STE buffer. To determine the viral titer, a 100-fold dilution of the viral stock was 
made and 5 ul of the serially diluted stock was used to infect Sentas cells. Forty-eight hours 
after infection, the cells were grown in medium containing 100 ug/ml G418. Colonies that 
were formed after two weeks in the selection medium were counted to determine the viral 
20 titer. 

Isolation ofblastodermal cells from stage X Barred Plymouth Rock (BPR) embryos. 
Freshly laid eggs were collected. The embryo at this stage consists of about 50,000-60,000 
cells in a small circular area called the blastodermal disc. The discs from about 30 embryos 
were dissected from the eggs and the cells dissociated using 1XPBS (phosphate buffer 
25 salme) (xmtaining 0.05% trypsin. The cells were centrifuged at 500 rpm for 5 minutes. The 
pellet was gently washed with 1 x PBS and pelleted again and counted using a 
hemocytometer. 

Interferon (WN) assay. Blood samples were collected from 6 wk old chicks and the 
interferon levels in the serum were measured using the hu-IFN-a ELBA Kit (PBL 

30 Biomedical Lab., New Brunswick, NT). 

1 19 WL stage X eggs were injected with 5 pi of pLNHX-MDOT-HWVSVg virus 
with a titer 6 x 10 4 /ml). 53 injected eggs survived, of which 20 hatched. Sperm samples 
were tested from the males at sexual maturity. Two males, # A 24 and A 34, showed the 
presence of the transgene and therefore were used for further breeding for testing the germ- 

35 line transmission. 
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Freshly isolated 2 x 10 5 BRD cells from stage X embryos were infected with 1.5 X 
10 4 pLNHX-MDOT-IFN/VSVg virus at 37° C for 1 hour. The cells were gently stirred 
every 10-15 minutes. While the blastodermal cells were being thus processed, 150 freshly 
laid WL (stage X) eggs were irradiated at 600 rads and set aside for the injections. A 5 ul 
5 cell suspension containing about 4000-5000 blastodermal cells were injected into each of 85 
irradiated stage X WL eggs through a hole drilled in the shell. The eggs were sealed and 
incubated to hatch. Out of 85 stage X WL eggs that were injected with the BRD cells 
infected with pLNHX-MDOT-IFN/VSVg virus, 47 survived and 15 of these hatched. The 
feather chimerism in these birds was between 5-85%. 
10 In an alternative experiment, freshly isolated 6 x 10 s BRD cells from stage X 

embryos were mixed with 4 x 1 0 5 P LNHX-CMVE-MDOT-IFN viral particle and incubated 
at 37" C for 1 hour. The cells were gently stirred every 10-15 minutes. While the 
blastodermal cells were being processed, 150 freshly laid WL (stage X) eggs were collected 
and irradiated at 600 rads and set aside for the injections. A 5 ul cell suspension containing 
15 about 4000-5000 cells was injected into each of 107 irradiated stage X WL eggs through a 
small hole drilled in the shell. The eggs were sealed and incubated to hatch. 

Out of 107 stage X WL eggs injected with the BPR cells infected with the pLNHX- 
CMVE-MDOT-IFN vims, 53 of these survived, of which 17 hatched. These birds showed 
varying degree of feather chimerism that ranged from 2-85%, as shown in Table 3 below. 

20 

Table 3: Chimera d istribution of chicks transgenic for pLNHX-CMVE-MDOT-IFN virus 

Chimerism % Black 
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Blood samples were collected from these chicks when they were 6wk old. 
interferon levels in lOOul serum sample was analyzed using the h-IFN-EUSA Kit. Results 
of the assay are shown in Fig 18. The successful detection of the transgene-encoded product 
(i.e. interferon) indicates that the BPR-injected cells were stably integrated into different . 

5 tissues and thereby demonstrating that Moloney leukemia viruses pseudotyped with VSVg 
can be used for generating transgenic birds. 

In a parallel experiment with a different MuLV/VSVg pseudotyped virus (pLNHX- 
MDOT-IFN), feather chimeric chicks that did not hatch (i.e. died during the incubation 
period) were collected. Three tissues, skin heart and lung, from these birds were analyzed 

10 for the presence of the transgene by TAQMAN® analysis. In three chicks, all three tissues 
showed the presence of the transgene. In the fourth chick, as shown in Figs. 1 9 and 20, the 
transgene was detected in two of the tissues. These results show that the injected BPR cells 
infected with Moloney viruses pseudotyped with VSVg are stably integrated into different 
tissues of the chick. 



6.30 Example 30: Construction of Lysozyme Promoter Plasmids 
The chicken lysozyme gene expression control region isolated by PCR amplification 
is fully disclosed in U.S. Patent Application No. 09/922,549, filed 03 August 2001 and 
incorporated herein by reference in its entirety. Ligation and reamplification of the 
20 fragments thereby obtained yielded a functionally contiguous nucleic acid construct 
- comprising the chicken lysozyme gene expression control region operably linked to a 
nucleic acid sequence encoding a human interferon a2b polypeptide and optimized for 
codon usage in the chicken. Briefly, chicken (Gallus gallus (White Leghorn)) genomic 
DNA was PCR amplified using the primers 5pLMAR2 and LE-6.1kbrevl in a first 
25 reaction, and Lys-6. 1 and LysElrev as primers in a second reaction. PCR cycling steps 
were: denaturation at 94° C for 1 minute; annealing at 60 °C for 1 minute; extension at 72 °C 
for 6 minutes, for 30 cycles using TAQ PLUS PRECISION™ DNA polymerase 
(STRATAGENE®, LaJolla, CA). The PCR products from these two reactions were gel 
purified, and then united in a third PCR reaction using only 5pLMAR2 and LysElrev as 
30 primers and a 10 minute extension period. The resulting DNA product was phosphorylated, 
gel-purified, and cloned into the EcoR V restriction site of the vector PBLUESOUPT® KS, 
resulting in the plasmid pl2.0-lys. 

pl2.0-lys was used as a template in a PCR reaction with primers 5pLMAR2 and 
LYSBSU and a 10 minute extension time. The resulting DNA was phosphorylated, gel- 



35 purified, and cloned into the EcoR V restriction site of pBluescript® KS, forming plasmid 
pl2.01ys-B. 
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p!2.01ys-B was restriction digested with iVb/ 1 and Bsu36 I, gel-purified, and cloned 
into Not I and Bsu36 I digested pCMV-LysSFIFNMM, resulting in pI2.0-iys-LSPIFiNMM. 
pl2.0-lys-LSPIFNMIVl was digested with Sal I and the SalltoNotI primer was annealed to 
the digested plasmid, followed by Not I digestion. The resulting 12.5 kb Not I fragment, 
comprising the lysozyme promoter region linked to IFNMAGMAX-encoding region and an 
SV40 polyadenylation signal sequence, was gel-purified and ligated to Not I cleaved and 
dephosphorylated PBLUESCRIPT® KS, thereby forming the plasmid pAVUCR-A115.93.L2. 

6.31 Example 31 : Complete Lysozyme Promoter and IFNMAGMAX 
Sequences 



The complete sequences of the lysozyme gene promoter and the co don-optimized 
human interferon a2b nucleic acid are fully disclosed in U.S. Patent Application No. 
09/922,549, filed 03 August 2001 and incorporated herein by reference in its entirety. The 
complete nucleotide sequence of the approximately 12.5 kb chicken lysozyme promoter 

1 5 region/IFNMAGMAX construct spans the 5 ' matrix attachment region (5 ' MAR), through 
the lysozyme signal peptide, to the sequence encoding the gene IFNMAGMAX and the 
subsequent polyadenylation signal sequence. The IFNMAGMAX nucleic acid sequence 
had been synthesized as described in Example 17 above. The expressed IFN a2b sequence 
within plasmid pAVUCR-A 115.93.1.2 functioned as a reporter gene for lysozyme promoter 

20 activity. This plasmid construct may also be used for production of interferon a2b in the 
egg white of transgenic chickens. 

632 Example 32: Expression in Transfected Cultured Avian Oviduct Cells of 
Human Interferon <x2b Regulated by the 12kb Lysozyme Promoter 

25 The oviduct was removed from a Japanese quail (Cotumix cotumix japonica) and 

the oviduct cells transfected with the lysozyme promoter-IFNMAGMAX as described in 
Example 21, above. The supernatant was analyzed by ELISA for human interferon a2b 
content 

The human interferon a2b contents of medium derive*! from cultured oviduct cells 
30 transfected with eitherpAWC^-A115.93.1.2 orthenegative control plasm 

EGFP, as shown in Fig. 16. Bars to the right of the figure represent the standards for the 
IFN ELISA 



35 
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6.33 Example 33: Production of Heterologous GM-CSF in Serum of 
Transgenic Chickens 

Seventy-three birds were injected with CMV-GMCSF (ALV) wherein a nucleic acid 
encoding GM-CSF was functionally linked to the cytomegalovirus promoter. All were 
5 subsequently tested. Three control birds that had nothing injected were also included. For 
each bird tested, approximately lOOul of blood was collected with heparinized tubes then 
diluted into lOOjal of PBS solution and spun to remove red blood cells. lOOjxl of the plasma 
was then assayed. 

As shown in Table 2 (below), three of the experimental birds had GM-CSF plasma 
10 levels that were higher than the highest available standard of 500 pg/ml used in the ELISAs. 
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When the dilution is factored in, three birds had greater than approximately 1 ng/ml. 
Eleven additional birds had GM-CSF levels within the range detectable by ELISA, from 26 
pg/ml to 182 pg/ml (with the dilution factored in). Control birds S8484, S8507 and S8508 
lO were negative. 

6.34 Example 34: Synthesis of the MDOT promoter construct 
Amplification of the ovomucoid and ovotransferrin promoter sequences 

Oligonucleotide primers 1 (SEQ ID NO: 38) and 2 (SEQ ID NO: 39), as shown in 

15 Fig. 22 were used to amplify the ovomucoid sequences. Oligonucleotide primers 3 (SEQ ID 
NO: 40) and 4 (SEQ ID NO: 41) were used to amplify the ovotransferrin sequence by PCR. 
The primers were designed such that the PCR-amplified ovomucoid sequences contained an 
Xho I restriction cleavage site at the 5' end and a Cla I site at the 3' end. Similarly, the 
PCR-amplified ovotransferrin product had a Cla I restriction site at the 5 ' end and a Hind JR 

20 site at the 3 ' end. The overlapping Cla I site was. used to splice the two-PCR products to 
create the MDOT promoter construct The nucleic acid sequence SEQ ID NO: 1 1 of the 
MDOT promoter construct is shown inFig. 14. The final product was cloned in a biuescript 
vector between the Xho I and Hind EE sites. From the biuescript vector the promoter region 
was released by Kpn VHTind Wl restriction digestion and cloned into the prc-CMV-IFN 

25 vector to replace the CMV promoter to create MDOT-IFN (clone #10). This plasmid was 
tested in vitro. 

Interferon synthesis directed by the MDOT promoter in transfected oviduct cells. 

The promoter activity was tested in vitro by transfecting the plasmid construct into 
tubular gland cells isolated from the quail oviduct. The transfected cells were treated with 
30 hormones (progesterone, estrogen and insulin). At 72 hrs after transfection, the supernatant 
media of the transfected cells were collected and the interferon levels analyzed using an 
ELBA assay. The results, as shown in Fig. 23 show a significant induction of interferon 
a2b expression in hormonally treated cells. 



35 



-99- 



WO 03/025146 



m» -sr 



««*»•* lilM* .«»••! 



jj U itllS on* >'' 
p «uU iu- ./ PCT/US02/29878 



i?vnmnle35: Production of Erythropoietin in the Serum of Transgenic 
Chickens 

Sixty birds were injected with a nucleic acid construct comprising a nucleic acid 
region encoding erythropoietin (EPO) 3 1 of, and operably linked to, the MDOT artificial 
5 promoter in the ALV vector (MDOT-EPO (ALV)) described in Example 34, above. All 
birds were subsequently tested. Two control birds that had nothing injected were also 
tested. Approximately lOOul of blood from each bird was diluted into lOOul of PBS/EDTA 
solution and spun to remove red blood cells. lOOul of the plasma was then assayed. 

As shown in Table 4 below, twenty-three of the experimental birds had EPO levels 
10 in their plasma higher than the highest available ELISA standard of 1540 pg/ml. 
Table 4: Production of erythropietin under the control of promoter MDOT 

EGG EGG EGG 

ELISA Taqman® ELISA ELISA ELISA 
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15 . 

When the dilution is factored in, 23 birds have greater than approximately 3080 
pg/ml. An additional 27 birds had EPO levels within the range detectable by ELISA, from 
58 pg/ml to 2450 pg/ml (with the dilution factored in). Control birds were negative. 
Although preferred embodiments of the invention have been described using 

20 specific terms, devices, and methods, such description is for illustrative purposes only. : The 
words used are words of description rather than of limitation. It is to be understood that • 
changes and variations maybe made by those of ordinary skill in the art-without departing . 
from the spirit or the scope of the present invention, which is set forth in the following 
c laims In addition, it should be understood that aspects of the various embodiments may 

25 be interchanged both in whole or in part 

All references cited herein are incorporated herein by reference in their entirety and 
for all purposes to the same extent as if each individual publication, patent or patent 
application was specifically and individually indicated to be incorporated by reference in its 

3Q entirety for all purposes. 

The citation of any publication is for its disclosure prior to the filing date and should 
not be construed as an admission that the present invention is not entitled to antedate such 
publication by virtue of prior invention. 

Many modifications and variations of this invention can be made without departing 
35 from its spirit and scope, as will be apparent to those skilled in the art. The specific 

embodiments described herein are offered by way of example only, and the invention is to 
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be limited only by the terms of the appended claims along with the full scope of equivalents 
to which such claims are entitled. 
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What is claimed is: 

1 . A method of producing a transgenic avian, said method comprising: 

(a) microinjecting into a cell of an avian embryo a nucleic acid 
comprising a transgene comprising a nucleotide sequence encoding a 

5 heterologous polypeptide; 

(b) introducing the microinjected avian embryo into an oviduct of a 
recipient hen, such that the recipient hen lays a shelled egg containing 
the microinjected avian embryo; and 

(c) incubating the shelled egg containing the microinjected avian embryo 
10 until said shelled egg hatches, 

thereby producing a transgenic avian containing the transgene. 

2. The method of Claim 1 , wherein the avian embryo is an early stage embryo 
having a germinal disk and said nucleic acid is microinjected into said germinal disk. 

15 

3. The method of Claim 2, wherein the early stage embryo is a stage I embryo. 

4. The method of Claim 1 , wherein the heterologous polypeptide is expressed 
in one or more cells of said transgenic avian. 

20 

5. The method of Claim 4, wherein the heterologous polypeptide is expressed 
in the serum of said transgenic avian. 

6. The method of Claim 4, wherein the heterologous polypeptide is expressed 
25 in the magnum of said transgenic avian. 

7. The method of Claim 1 further comprising the step of allowing the 
transgenic avian to develop to sexual maturity. 

30 8. The method of Claim 7, wherein the heterologous polypeptide is delivered to 

the white of a developing avian egg produced by the transgenic avian. 

9. The method of Claim 1, wherein the avian is a chicken. 
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10. The method of Claim 1 , wherein the nucleotide sequence encoding said 
heterologous polypeptide is operably linked to a transcriptional regulatory element that can 
direct gene expression in one or more cells of said transgenic avian. 

5 1 1 . The method of Claim 10, wherein the transcriptional regulatory element is 

selected from the group consisting of the promoter regions of the avian genes encoding 
ovalbumin, lysozyme, ovomucoid, ovomucin, conalbumin and ovotransferrin. 

12. The method of Claim 1 1 , wherein the selected nucleic acid further comprises 
10 a chicken lysozyme gene expression controlling region comprising the nucleic acid 

sequence SEQ ID NO: 7. 

13. The method of Claim 10, wherein the transcriptional regulatory element is a 
tissue specific promoter. 

15 14 . The method of Claim 13, wherein the tissue specific promoter is specific for 

the magnum. 

15. The method of Claim-l, therein the transgene comprises at least one 
20 cytomegalovirus promoter. 

16. The method of Claim 10, wherein the transcriptional regulatory element 
comprises at least two regions derived from the promoter of an avian gene, said regions 
being from a different promoter. 

17. The method of Claim 16, wherein the transcriptional regulatory element has 
nucleotide sequence comprising the sequence SEQ HO NO: 11. 



a 



18. The method of Claim 1, wherein the transgene comprises at least one matrix 
30 attachment region (MAR). 

19. The method of Claim 18, wherein the transgene comprises a 5' MAR and a 3 1 
MAR which flank said nucleotide sequence. 

35 20. The method of Claim 1, wherein the nucleic acid is combined with a nuclear 

localization signal (MLS) peptide prior to said microinjection. 
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21 . The method of Claim 1, wherein the nucleotide sequence encoding a 
heterologous polypeptide is optimized for codon usage by an avian. 

22. The method of Claim 1, wherein the nucleotide sequence encoding a 
5 polypeptide is optimized for codon usage by a chicken. 

23. The method of Claim 1, wherein the heterologous polypeptide is selected 
from the group consisting of a cytokine, a hormone, an enzyme, a structural polypeptide and 
an immunoglobulin polypeptide. 

10 

24. The method of Claim 23, wherein the cytokine is selected mom the group 
consisting of interferon, mterleukin, granulocyte colony-stimulating factor, granulocyte- 
macrophage colony-stimulating factor, stem cell factor, erythropoietin, thrombopoietin and 
stem cell factor. 

15 

25. The method of Claim 23, wherein the cytokine is an interferon. 



26. The method of Claim 1, wherein the transgene comprises an internal 
ribosome entry site (IRES). 

20 

27. The method of Claim 26, wherein the transgene comprises at least two 
nucleotide sequences each encoding a heterologous polypeptide. 

28. The method of Claim 27, wherein the at least two nucleotide sequences 
25 encode at least two heterologous peptides that form a multimeric protein. 

The method of Claim 28, wherein the multimeric protein specifically binds a 

The method of Claim 29, wherein the multimeric protein is an antibody. 

3 1 . The method of Claim 1 , wherein the heterologous polypeptide comprises a 
peptide region suitable for the isolation of the heterologous polypeptide. 

32. The method of Claim 1, wherein the nucleic acid is a eukaryotic viral vector. 



29. 
selected ligi 

30 30. 
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33. The method of Claim 32, wherein the eukaryotic viral vector is derived from 
any of the group consisting of avian leukosis virus, adenovirus, transferrin-polylysine 
enhanced adenoviral vectors, human immunodeficiency virus vectors, lentiviral vectors, and 
Moloney murine leukemia virus-derived vectors. 

5 

34. The method of Claim 1 , wherein the nucleic acid is a plasmid vector. 

35. The method of Claim 1 , wherein the nucleic acid is a bacterial artificial 
chromosome (BAC). 

10 

36. The method of Claim 1 , wherein the nucleic acid is not a eukaryotic viral 

vector. 

37. The method of Claim 1 0, wherein the transcriptional regulatory element is a 
15 regulatable promoter. 

38. The method of Claim 12, wherein the selected nucleic acid further comprises 
a region encoding the 3' region of the chicken lysozyme gene and having the nucleotide 
sequence SEQ ID NO: 9. 

20 

39. The method of Claim 4 or 8 further comprising isolating said heterologous 
peptide from said transgenic avian or an egg laid by said transgenic avian. 

40. A transgenic avian that produces at least one heterologous polypeptide hi egg 
25 white, wherein the transgenic avian or founder ancestor of said transgenic avian was not 

produced using a eukaryotic viral vector. 

41. A transgenic avian produced by the method of Claim 1 . 



30 



42. The transgenic avian of Claim 40 or 41, wherein the avian is a chicken. 



43. The transgenic avian of Claim 42, wherein the heterologous polypeptide is 
selected from the group consisting of a cytokine, a hormone, an enzyme, a structural 
protein, and an immunoglobulin polypeptide. 

44. The transgenic avian of Claim 42, wherein the cytokine is an interferon. 
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45. The transgenic avian of Claim 40 or 41 , wherein the transgenic avian 

produces a heterologous multimeric protein. 

- 

46. The transgenic avian of Claim 45, wherein the heterologous multimeric 
5 protein specifically binds a selected ligand. 

47. The transgenic avian of Claim 45, wherein the heterologous multimeric 
protein is an antibody. 



10 



48. An avian egg produced by the transgenic avian of Claim 40 or 41 . 

49. An avian egg produced by the transgenic avian of any of Claims 42-47. 

50. A heterologous protein produced by the transgenic avian of Claim 40 or 41, 
15 wherein the heterologous protein comprises a heterologous polypeptide selected from the 

group consisting of a cytokine, a hormone, an enzyme, a structural protein, and an 
immunoglobulin polypeptide. 

5 1 . The heterologous polypeptide of Claim 50, wherein the rytokine is an 
20 interferon. 

52. The heterologous protein of Claim 50, wherein the heterologous protein is a 
multimeric protein. 

25 53 . The heterologous protein of Claim 50, wherein the heterologous protein is an 

antibody. 
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SEQ ID NO: 6 

TGCCGCCTTC TTTGATATTC ACTCTGTTGT ATTTCATCTC TTOTGCCGA TCAW3GATA 60 

is Tsssssi sssg EE sS as 

CAGCAGGTGG TGGTTGGGGT GGTGGCAGCT CA^TGACAGG A^GGTTTTT ™CTGTU *w 

SB5S5S 5SSSS5 « = ™ S3 3 

S S ESSSt ™ IS = s 

TAGCTGTTGT TACACGAGAT ACCTTATTAA GTTTAGGCCA GCTTGATGCT TWmr I lit b4U 
OTTreSfir AGTGAGCGTT CTCTGGTTTT TTTCCTTO AACTGGTGAG GCTTAGAm 00 
TTCTAATGGG ATTTTTTACC TGATGATCTA GTTGCATACC CAAAIhLUb ^ 
CCTAGTTMC ATGTTGATAA CTTCGGATTT ACATCHGTA TATACTTGTC £™MW 

s ssssss is Ss sH «s s 

S SS £2S «g |ggg TTT^CW » 

ssss sgg = ~ ja 

TTTTGGCATA GGAAGCAAGC TGCATGGATT TGTTTGGGAC ™VUIAU -[ ATGCAGAAG 1200 
AGTGCATAGG TTTTAAACAC AGTTGCAGCA TGCTMCGAG TCACAbLbl I >^ 
TGATGCCTGG ATGCCTGTTG CAGCTGTTTA CGGCACTGCC TJGCAGTBAb ^< UZQ 
AGGGGTGGGG TGCTTTGTGT CCTGTTCCCA CACGCTGCCA CALAhLL^u £ 138Q 
CATCTCACCT GCTGGGTACT TTTCAAACCA TCTTAGCAGT A™AJWb mQ 
ACAGAGAAGT TCCTCAGTTG GATATTCTCA TGGGATGTCT TTTHCLUU ^ 1500 
GTATGATAAA GCATCTCTAT TTGTMATTA TGaCTTGTT AGTTCCTGAA ^ 
AGCACCACTT ATTGCAGCAG GTGTAGGCTC TGGTGTGGCC TGTCTCTb lb ^ 1620 
TTAAAGCTTC mGGAAATA CACTGACTTG ATTGAAGTCT. ™™BM ™™ 168Q 
CTTACCTTTG ATCCCAATGA AATCGAGCAT ™GTTGTA AAAbAAj^ mo 
ACCATGTAAT GTAATTTTAC ACCCCCAGTG CTGACACTTT .^™IAU ^ 18Q0 
ACTTTGGCCT CACCCTCTTG TGTACTGW TTTGTAATAG AAAAjAim ^ 186Q 
TATGATTATT ACATTATGAA AGAGACATTC TGCTGATCTT CAAP lib lAAb ig20 
GTGCGTGTGC TTTTATAAAT ACMGTGATT GCAMTTAGT ^GTCC ™AAMAA 1^ 
AAAAAAAAAG TAATATAAAA AGGACCAGGT STTTTACMG JbAAftiAuai ' ^ 2Q40 

TAAACAGTTA CATTT1TATG AAGATTACCA ^GCTGA CTTTCTAAAC ATAAGGUbl i 
ATTGTCTTCC TGTACCATTG; CATTTCCTCA ^CCCMTTT GCACAAGGA I ^ zm 
ACTATTCAAG AAATGGCTTT GAAATACAGC ATGGGAGCTT |£ I AAAGGATTTT 2220 

TTGCACTGCA AAATGTCAGG AWGGATGT CTCTCAGAAT SCCCAAULL ^ 
ATATGTGTAT ATAGTAAGCA GTTTCCTGAT TCCAGCAGGC CAAAGAfa IU mQ 
GTGTTGCCGG AGACCTGTAT TTCTCAACAA GGTAAGATGG ™CLLAbL* ^ 2400 
TTAATACATT TTCAGCAGAA GTACTTAGTT AATCTCTACC TTTAGGGATC 
TTTTAGATGT TATACTTGAA ATACTGCATA ACTTTTAGCT TTAAACTGCA 2520 

sssas ssf SI Ss S s nts 

SS IS SS 2700 

FIG .1 A 
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GTTATTATTT CTGAAACTAG AAATGATGTT GTCTTCATCT GCTCATCAM CACTTCATGC 2760 
AGAGTGTAAG GCTAGTGAGA AATGCATACA TTTATTGATA CTTTTTTAAA GTCAACTTTT 2820 
TATCAGATTT TTTTTTCATT TGGAAATATA TTGTTTTCTA GACTGCATAG CTTCTGAATC 2880 
tcEwEE TCTGATTGGC ATGAAGAAGC ACAGCACTCT TCATCTTACT TAAACTTCAT 294 
TTTGGAATGA AGGAAGTTAA GCAAGGGCAC AGGTCCATGA AATAGAGACA GTGCGCTCAG 3000 
GAGAAAGTGA ACCTGGATTT CTTTGGCTAG TGTTCTAAAT CTGTAGTGAG GAMGTAACA 3060 
rfTGATTCCT TGAAAGGGCT CCAGCTTTAA TGCTTCCAAA TTGAAGGTGG CAGGCAACTT 3120 
GGCCACTGGT TATTTACTCC ATTATGTCTC AGTTTCGCAG CTAACCTGGC TTCTCCACTA 3180 
TTGAGCATGG ACTATAGCCT GGCTTCAGAG GCCAGGTGAA GGTTGGGATG GGTGGAAGGA 3240 

StcGGG GGACTGTGGG GACTCCAAGC T^GCTTGGG GTGGGCAGCA 330 
CAGGGAAAAG TGTGGGTAAC TATTTTTAAG TACTGTGTTG CAAACGTCTC ATCTGCAAAT 3360 
ACGTAGGGTG TGTACTCTCG AAGATTAACA GTGTGGGTTC AGTMTATAT GGATGAATTC 3420 
ACAGTGGAAG CATTCAAGGG TAGATCATCT MCGACACCA GATCATCAAG CTATGATTGG 3480 
AAGCGGTATC AGAAGAGCGA GGAAGGTAAG CAGTCTTCAT ATGTTTTCCC TCCACGTMA 3540 
GCAGTCTGGG AAAGTAGCAC CCCTTGAGCA GAGACAAGGA AATAATTCAG GAGCATGTGC 3600 
TAGGAGAACT TTCTTGCTGA ATTCTACTTG CAAGAGCTTT GATGCCTGGC ™TGGTGCC 3660 
TTfTRCAGCA CCTGCAAGGC CCAGAGCCTG TGGTGAGCTG GAGGGAAAGA TTCTGCTCAA 3720 
GTCCAAGCTT CAGCAGGTCA TTGTCTTTGC TTCTTCCCCC AGCACTGTGC AGCAGAGTGG 3780 
AACTGATGTC GAAGCCTCCT GTCCACTACC TGTTGCTGCA GGCAGACTGC TCTCAGAAAA 3840 
ATAGAGCTAA CTCTATGCCA TAGTCTGAAG GTAAAATGGG TTTTAAAAAA GAAAACACAA 3900 
S SttA SgAAA GCAGTGGTAA A^TGGTAGA AAAGGTGCAG 3960 
AAGCCCCCAG GCAGTGTGAC AGGCCCCTCC TGCCACCTAG AGGCGGGAAC AAGCTTCCCT 4020 
GrTTAGGGCT CTGCCCGCGA AGTGCGTGTT TCTTTGGTGG GTTTTGTTTG GCGTTTGGTT 4080 
SrAGATTTA GACACAAGGG MGCCTGAAA GGAGGTGTTG GGCACTATTT TGGTTTGTAA 4140 
SctSJct Sw A^GTGAG GGAGTGTAGC GAATWCA ATTTAAAATA 4200 
AAGTTGCAAG AGATTGAAGG CTGAGTAGTT ^GAGGGTAA CACGTTTAAT ^TCTCT 4260 
rAAAPTArTR rTTTTAAACA CTTGTTTGAG TGGTGAGACC TTGGATAGGT GAGTGCTCTT 4,3<:u 
™StcTC T6ATGCACTT GCTTGTCCTT 1TCCATCCAC ATCCATGCAT TCCACATCCA 4380 
SatTTCTC ACTTATCCCA SgTCAT ATCTGACATA CCTGTCTCTT CGTCACTTGG 4440 
TCAG^AGAAA CAGATGTGAT AATOXCAGC CGCCCCAAGT TTGAGAAGAT GGCAGTTGCT 4500 
TrTTTTfTTT TTTCCTGCTA AGTAAGGATT TTCTCCTGGC TTTGACACCT CACGAAATAG 4560 
tcSctcS TTACATTCTG GGC ATTATTT CAAATATCTT TGGAGTGCGC TGCTCTCAAG 4620 
TCTTCCTGCC TTACAT lUb TCTTAGAGTG AAAGAGAAGG AAGAGAAGAT 4680 

SS SS ™c*t Jtgmwtg ^caaaggtg z^mc mo 

TCTGAGGAAC GGGCAGCGTT TGCCTCTGAA AGCAAGGAGG TCTGCGGAGT ^CACTTATT 4800 
" TTGCAACTGA TGGTGGAACT GGTGCTTAAA GCAGATTCCC TAGGTTCCCT GCTACTTCTT 4860 
TTfCTTCTTG gSgTCAGTT WTTCTGAC AGACAAACAG CCACCCCCAC TGCAGGCTTA 4920 
S SS SSSsm CAGCTCTGCC CWOAAAG G^AmAAA 4980 
CGGGCACCAT TCATCCCAAA CAGGATCCTC ATTCATGGAT ^TCTM GGMCTTCGG 5040 
nrrwcnc. AAAACATTAA TTGGAGTACG AATGTAATrA AAACTGCATT CTCGCATTCC 5100 

Swum? JgSctbSct SSagcatg taktcggca gctcccaot tctcammc 5 

rAnrATGGA GGAGTAGTAA AAATGGAGAC CGATTCAGAA CAACCAACGG AGTGTTGCCG 522U 
KlK TbSSSSt Sg TGT^GGAC ATTimTTTA MTACATAAA 5280 
CTACTTCAAA TGAGGTCGGA GAAGGTCAGT GTTTTATTAG CAGCCATAAA ACCAGGTGAG 5340 
SScaS TTTTCTCTAC AAGAAAAACG ATTCTGAGCT CTGCGTAAGT ATAAGTTCTC 5400 
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rATAcrrrrr r^mCCCC CCTGGCTGCC TGCCATCTCA GCTGGAGTGC AGTGCCATTT 5460 

TCTCTCACAG CAGTAATGGG ACAATACTTC ACAAAAATTC TTTCTTTTCC 5520 
S GATCCCTACT GGTTTTACGT T£CCCCTGA 

GGTTTTTAAT SS S S 5700 

CCAGCATTTT 6GTTTTTAA I IAI^i^mmim -rrAArTrTTT TATTTAGAGA 5760 

ssssss = ™ sasss ss ss 



JSSSSSS S Kg ™j ™ ffl ss 

— ~ SS S iS Ss » 

SI S SEI aaas sssss s 

SSSSS JfflS SSgg « ™, ™c s 

iilliii 
sillisl 

MM 

TTGTCCTCCA TCCTTTCCCA GCTTGTATCT TTGACAAATA CAgCCWTT TTgTb^ ^ 

llillsl 

S» S SS SS GATTCCTTTT TOTAOTC 8040 
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TCTTCAGCCA ATAAACAAAA TGTGAGAAGC CCAAACAAGA ACTTGTGGGG CAGGCTGCCA 8100 
TCAAGGGAGA GACAGCTGAA GGGTTGTGTA GCTCAATAGA ATTAAGAAAT AATAAAGCTG 8160 
TGTCAGACAG TTTTGCCTGA TTTATACAGG CACGCCCCAA GCCAGAGAGG CTGTCTGCCA 8220 
AGGCCACCTT GCAGTCCTTG GTTTGTAAGA TAAGTCATAG GTAACTTTTC TGGTGAATTG 8280 
CGTGGAGAAT CATGATGGCA GTTCTTGCTG TTTACTATGG TAAGATGCTA AAATAGGAGA 8340 
CAGCAAAGTA ACACTTGCTG CTGTAGGTGC TCTGCTATCC AGACAGCGAT GGCACTCGCA 8400 
CACCAAGATG AGGGATGCTC CCAGCTGACG GATGCTGGGG CAGTMCAGT GGGTCCCATG 8460 
CTGCCTGCTC ATTAGCATCA CCTCAGCCCT CACCAGCCCA TCAGAAGGAT CATCCCAAGC 8520 
TGAGGAAAGT TGCTCATCTT CTTCACATCA TCAAACCTTT GGCCTGACTG ATGCCTCCCG 8580 
GATGCTTAAA TGTGGTCACT GACATCTTTA TTTTTCTATG ATTTCAAGTC AGAACCTCCG 8640 
GATCAGGAGG GAACACATAG TGGGAATGTA CCCTCAGCTC CAAGGCCAGA TCTTCCTTCA 8700 
ATGATCATGC ATGCTACTTA GGAAGGTGTG TGTGTGTGAA TGTAGAATTG CCTTTGTTAT 8760 
TTTTTCTTCC TGCTGTCAGG AACATTTTGA ATACCAGAGA AAAAGAAAAG TGCTCTTCTT 8820 
GGCATGGGAG GAGTTGTCAC ACTTGCAAAA TAAAGGATGC AGTCCCAAAT GTTCATAATC 8880 
TCAGGGTCTG AAGGAGGATC AGAAACTGTG TATACAATTT CAGGCTTCTC TGAATGCAGC 8940 
TTTTGAAAGC TGTTCCTGGC CGAGGCAGTA CTAGTCAGAA CCCTGGGAAA CAGGAACAAA 9000 
TGTCTTCAAG GTGCAGCAGG AGGAAACACC TTGCCCATCA TGAAAGTGAA TAACCACTGC 9060 
CGCTGAAGGA ATCCAGCTCC TGTTTGAGCA GGTGCTGCAC ACTCCCACAC TGAAACAACA 9120 
GTTCATTTTT ATAGGACTTC CAGGAAGGAT CTTCTTCTTA AGCTTCTTAA TTATGGTACA 9180 
TCTCCAGTTG GCAGATGACT ATGACTACTG ACAGGAGAAT GAGGAACTAG CTGGGAATAT 9240 
TTCTGTTTGA CCACCATGGA GTCACCCATT TCTTTACTGG TATTTGGAAA TAATAATTCT 9300 
GAATTGCAAA GCAGGAGTTA GCGAAGATCT TCATTTCTTC CATGTTGGTG ACAGCACAGT 9360 
TCTGGCTATG AAAGTCTGCT TACAAGGAAG AGGATAAAAA TCATAGGGAT AATAAATCTA 9420 
AGTTTGAAGA CAATGAGGTT TTAGCTGCAT TTGACATGAA GAAATTGAGA CCTCTACTGG 9480 
ATAGCTATGG TATTTACGTG TCTnTTGCT TAGTTACTTA TTGACCCCAG CTGAGGTCAA 9540 
GTATGAACTC AGGTCTCTCG GGCTACTGGC ATGGATTGAT TACATACAAC TGTAATTTTA 9600 
GCAGTGATTT AGGGTTTATG AGTACTTTTG CAGTAAATCA TAGGGTTAGT AATGTTAATC 9660 
TCAGGGAAAA AAAAAAAAAG CCMCCCTGA CAGACATCCC AGCTCAGGTG GAAATCAAGG 9720 
ATCACAGCTC AGTGCGGTCC CAGAGAACAC AGGGACTCTT CTCTTAGGAC CTTTATGTAC 9780 
AGGGCCTCAA GATAACTGAT GTTAGTCAGA AGACTTTCCA TTCTGGCCAC AGTTCAGCTG 9840 
AGGCAATCCT GGAATTTTCT CTCCGCTGCA CAGTTCCAGT CATCCCAGTT TGTACAGTTC 9900 
TGGCACTTTT TGGGTCAGGC CGTGATCCAA GGAGCAGAAG TTCCAGCTAT GGTCAGGGAG 9960 
TGCCTGACCG TCCCAACTCA CTGCACTCAA ACAAAGGCGA AACCACAAGA GTGGCTTTTG 10020 
TTGAAATTGC AGTGTGGCCC AGAGGGGCTG CACCAGTACT GGATTGACCA CGAGGCAACA 10080 
TTAATCCTCA GCAAGTGCAA TTTGCAGCCA TTAAATTGAA CTAACTGATA CTACAATGCA 10140 
ATCAGTATCA ACAAGTGGTT TGGCTTGGAA GATGGAGTCT AGGGGCTCTA CAGGAGTAGC 10200 
TACTCTCTAA TGGAGTTGCA TTTTGAAGCA GGACACTGTG AAAAGCTGGC CTCCTAAAGA 10260 
GGCTGCTAAA CATTAGGGTC MTTTTCGAG TGCACTTTCT GAAGTGTCTG CAGTTCCCCA 10320 
TGCAAAGCTG CCCAAACATA GCACTTCCAA TTGAATACAA TTATATGCAG GCGTACTGCT 10380 
TCTTGCCAGC ACTGTCCTTC TCAAATGAAC TCAACAAACA ATTTCAAAGT CTAGTAGAAA 10440 
GTAACAAGCT TTGAATGTCA TTAAAAAGTA TATCTGCTTT CAGTAGTTCA GCTTATTTAT 10500 
GCCCACTAGA AACATCTTGT ACAAGCTGAA CACTGGGGCT CCAGATTAGT GGTAAAACCT 10560 
ACTTTATACA ATCATAGAAT CATAGAATGG CCTGGGTTGG AAGGGACCCC AAGGATCATG 10620 
MGATCCAAC ACCCCCGCCA CAGGCAGGGC CACCAACCTC CAGATCTGGT ACTAGACCAG 10680 
GCAGCCCAGG GCTCCATCCA ACCTGGCCAT GAACACCTCC AGGGATGGAG CATCCACAAC 10740 
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CTCTCTGGGC AGCCTGTGCC AGCACCTCAC CACCCTCTCT GTGAAGAACT TTTCCCTGAC 10800 
ATCCMTCTA AGCCTTCCCT CCTTGAGGTT AGATCCACTC CCCCTTGTGC TATCACTGTC 10860 
TACTCTTGTA AAAAGTTGAT TCTCCTCCTT TTTGGAAGGT TGCAATGAGG TCTCCTTGCA 10920 
GCCTTCTTCT CTTCTGCAGG ATGAACAAGC CCAGCTCCCT CAGCCTGTCT TTATAGGAGA 10980 
GGTGCTCCAG CCCTCTGATC ATCTTTGTGG CCCTCCTCTG GACCCGCTCC AAGAGCTCCA 11040 
CATCTTTCCT GTACTGGGGG CCCCAGGCCT GAATGCAGTA CTCCAGATGG GGCCTCAAAA 11100 
GAGCAGAGTA AAGAGGGACA ATCACCTTCC TCACCCTGCT GGCCAGCCCT CTTCTGATGG 11160 
AGCCCTGGAT ACAACTGGCT TTCTGAGCTG CAACTTCTCC TTATCAGTTC CACTATTAAA 11220 
ACAGGAACAA TACAACAGGT GCTGATGGCC AGTGCAGAGT TTTTCACACT TCTTCATTTC 11280 
GGTAGATCTT AGATGAGGAA CGTTGAAGTT GTGCTTCTGC GTGTGCTTCT TCCTCCTCAA 11340 
ATACTCCTGC CTGATACCTC ACCCCACCTG CCACTGAATG GCTCCATGGC CCCCTGCAGC 11400 
CAGGGCCCTG ATGAACCCGG CACTGCTTCA GATGCTGTTT AATAGCACAG TATGACCAAG 11460 
TTGCACCTAT GAATACACAA ACAATGTGTT GCATCCTTCA GCACTTGAGA AGAAGAGCCA 11520 
AATTTGCATT GTCAGGAAAT GGTTTAGTAA TTCTGCCAAT TAAAACTTGT TTATCTACCA 11580 
TGGCTGTTTT TATGGCTGTT AGTAGTGGTA CACTGATGAT GAACAATGGC TATGCAGTAA 11640 
AATCAAGACT GTAGATATTG CAACAGACTA TAAAATTCCT CTGTGGCTTA GCCAATGTGG 11700 
TACTTCCCAC ATTGTATAAG AAATTTGGCA AGTTTAGAGC AATGTTTGAA GTGTTGGGAA 11760 
ATTTCTGTAT ACTCAAGAGG GCGTTTTTGA CAACTGTAGA ACAGAGGAAT CAAAAGGGGG 11820 
TGGGAGGAAG TTAAAAGAAG AGGCAGGTGC AAGAGAGCTT GCAGTCCCGC TGTGTGTACG 11880 
ACACTGGCAA CATGAGGTCT TTGCTAATCT TGGTGCTTTG CTTCCTGCCC CTGGCTGCCT 11940 
TAGGGTGCGA TCTGCCTCAG ACCCACAGCC TGGGCAGCAG GAGGACCCTG ATGCTGCTGG 12000 
CTCAGATGAG GAGMTCAGC CTGTTTAGCT GCCTGAAGGA TAGGCACGAT TTTGGCTTTC 12060 
CTCAAGAGGA GTTTGGCAAC CAGTTTCAGA AGGCTGAGAC CATCCCTGTG CTGCACGAGA 12120 
TGATCCAGCA GATCTTTAAC CTGTTTAGCA CCAAGGATAG CAGCGCTGCT TGGGATGAGA 12180 
CCCTGCTGGA TAAGTTTTAC ACCGAGCTGT ACCAGCAGCT GAACGATCTG GAGGCTTGCG 12240 
TGATCCAGGG CGTGGGCGTG ACCGAGACCC CTCTGATGAA GGAGGATAGC ATCCTGGCTG 12300 
TGAGGAAGTA CTTTCAGAGG ATCACCCTGT ACCTGAAGGA GAAGAAGTAC AGCCCCTGCG 12360 
CTTGGGAAGT CGTGAGGGCT GAGATCATGA GGAGCTTTAG CCTGAGCACC AACCTGCAAG 12420 
AGAGCTTGAG GTCTAAGGAG TAAAAAGTCT AGAGTCGGGG CGGCCGGCCG CTTCGAGCAG 12480 
ACATGATAAG ATACATTGAT GAGTTTGGAC AAACCACAAC TAGAATGCAG TGAAAAAAAT 12540 
GCTTTATTTG TGAAATTTGT GATGGTATTG CTTTATTTGT AACCATTATA AGCTGCAATA 12600 
AACAAGTTAA CAACAACAAT TGCATTCATT TTATGTTTCA GGTTCAGGGG GAGGTGTGGG 12660 
AGGI MIMA AAGCAAGTAA AACCTCTACA AATGTGGTAA AATCGATAAG GATCCGTCGA 12720 
GCGGCCGC 12728 
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SEQ ID NO: 5 



rrrrnrrrrr rffAGACCCA CAGCCTGG6C A6CAGGAGGA CCCTGATGCT GCTGGCTCAG 

■■■■■■ 

TTGAGGTCTA AGGAGTAA 498 
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SEQ ID NO: 7 



rrrrrrmr TTTrATATTC ACTCTGTTGT ATTTCATCTC TTCTTGCCGA TGAAA GGATA 60 

Si Si sag = s= S 
k a = S IS Ssss s 



S S= rm^A S! SS 

JSSSK Sf Scramr nromw SSSSSs SS too 

sssili 
illliil 

CCCAGGGAAG TGCAGATCCA CGTGCATATT CTTAAftbAtt^ TTAAATTATT TTCGTAACGG 1140 

KS S « £S ™ ~? iS 

TGATGCCTGG ATGCCTGTTG CAGCTGTTTA CGG»CTGCC JTBCASTCM 

K ffiSSg ggg ™ TTA^rGAA 13B0 

mSSm SS Egg ggg « = ». 

S S S Sf §s s i 

sssss ass ggg = S S a 

TAAACAGTTA CATTTTTAJG MGATTACCA GCGCTGCTGA CTTTCTAAAC 

s i= sss gsg ss§s ss? ms 
= ssass ggg SS SI safe 

GTGTTGCCGG AGACCTGTAT TTCTCMCAA GGTAAGATGG ™CTAW.A £J»J 24Q(J 
TTAATACATT TTCAGCAGAA GTACTTAGTT AATCTCTACC ™GGGATC GTTU^ilai 
TTTTAGATGT TATACTTGAA ATACTGCATA ACTTTTAGCT TTCATGGGTT ™^ ^ 

ISSSSSi 
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she ssss sss SSSI 

=£ = m Wi H s i 

GAGAAAGTGA ACCTGGATTT CTTTG6CTA6 TGTTCTAMT CTGTAGTSAfa ^ m 
CCCGATTCCT TGAAAGGGCT CCAGCTTTM TGCTTCCAM jrej^* jycTCCACTA 3180 
GGCCACTGGT TATTTACTGC ATTATGTCTC AGTTTCGCAG CTMCCTGGL > ^ 
TTGAGCATGG ACTATAGCCT SGCTTCAGAG GCCAGblbAA QTGGGCAGCA 3300 

GTGCTGGGCT GTGGCTGGGG GGACTGTGGG GACTCCMGC TGAGCT Ibbb * 336Q 
CAGGGAAAAG TGTGGGTAAC TATTTTTAAG TACTGTGTTG CMALbiu^ 2m 
ACGTAGGGTG TGTACTCTCG MGATTAACA STGTGGGTTC AGTAATATAT GGAi ^ 
ACAGTGGAAG CATTCAAGGG TAGATCATCT MCGACACCA GAT^AAb 354Q 
AAGCGGTATC AGAAGAGCGA GGAAGGTAAG CAGTCTTCAT AJ£M HbLu GAGCATGTGC 3600 
GCAGTCTGGG AAAGTAGCAC CCCTTGAGCA GAG^ggSGA MTAATTCAb ^ 
TAGGAGAACT TTCTTGCTGA ATTCTACTTG CMGAGCTTT GATGLUbbb Q 
TTCTGCAGCA CCTGCAAGGC CCAGAGCCTG TGGTGAGCTG Gj^AAbA 378Q 
GTCCAAGCTT CAGCAGGTCA TTGTCTTTGC ^CTTCCCCC AbbAur jcTCAGAAAA 3840 
AACTGATGTC GAAGCCTCCT GTCCACTACC TGTTGCTGCA G^GACTbL ^ 39Q0 
AGAGAGCTAA CTCTATGCCA TAGTCTGAAG GTAAAATGGG T^AAAAAA JJJg^ 3%0 
AGGCAAAACC GGCTGCCCCA TGAGAAGAAA GCAGTGCTAA ALAl bb^ 4020 
AAGCCCCCAG GCAGTGTGAC AGGCCCCTCC TGCCACCTAG AGGCGGbAAb mJJ mQ 

GCCTAGGGCT CTGCCCGCGA AGTGCGTGTT TCTTlbb ibb bM . i *i TGGTTTGTAA 4140 
TTGAGATTTA GACACMGGG AAGCCTGAAA GGAGGTGTTG GGCACTATTT ^ mQ 
AGCCTGTACT TCAAATATAT ATTTTGTGAG GGACTbl AbL ^LJ^ GAGATCTTCT 4260 
AAGTTGCAAG AGATTGAAGG CTCAGTAGTT GAGAGGGTM CACbll WYA. ^ ^ ^ 
GAAACTACTG CTTCTAAACA CTTGTTTGAG TGGJGAGACC jrGGATA&GT ^ 
GTTACATGTC TGATGCACTT GCTTGTCCTT TTCCATCCAC AT CGTCACTTGG 4440 

CGCATTTGTC ACTTATCCCA TATCTGTCAT ATCTGACATA CCTCTCTCTT °™TBCT 450Q 
TCAGAAGAAA CAGATGTGAT MTCCCCAGC ^CCAAbl CACGAAATAG 4560 

TCTTTCCCTT TTTCCTGCTA AGTMGGATT jTCTCCTbbb 1 1 TGCTCTCAAG 4620 

TCTTCCTGCC TTACATTCTG GGCATTATTT CAAATATCTT TBSgTWgu ^ mQ 
TTTGTGTCTT CCTACTCTTA GAGTGAATGC TCmGAGTG AMGAGAAbb ^ mQ 
GTTGGCCGCA GTTCTCTGAT GAACACACCT CTGAATAATC GCCAAAbWb mQ 
TCTGAGGAAC GGGCAGCGTT TGCCTCTGAA AGCMGGAbL GCTACTTCTT 4860 

TTGCAACTGA TGGTGGAACT GGTCCTTAAA GCAGATTCCC TAGGTTCCCT GC^CTTU I 
TTCCTTCTTG GCACTCAGTT TATTTCTGAC AGACAMCAG CMCCCCAC 



GAAAGTATGT GGCTCTGCCT G^GTGTTA U*f i^gj- CAAGCTGTAA GGAACTTGGG 5040 
CGGGCACCAT TCATCCCAAA CAGGATCCTC ATTCATOT ™J£Ub^ C TCGCATTCC 5100 
CTCCAACCTC AAAACATTAA TTGGAGTACG AATGTAATTA AAACTbbAi_. ^ 516Q 
TAAGTCATTT AGTCTGGACT CT&CAGCATG TAGGTCGGCA GCTCCLAL U 522Q 
CACTGATGGA GGAGTAGTAA MATGGAGAC CGATTCAGAA CAALUAALbb 5280 
AAGAAACTGA TGGAAATAAT GCATGAATTG TGTbGTGGAC Al 1 1 1 1 1 m ^.^q mo 
CTACTTCAAA TGAGGTCGGA GAAGGTCAGT GTTTTATTAG CAGXC^TAAA ACCA^ ^ 
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CATAGCGGCT GAAGCTCCCC CCTGGCTGCC TGCCATCTCA GOGGAGTGC AGTGCCATTT 5460 

CCTTGGGGTT TCTCTCACAG CAGTMTGGG ACAATACTTC ACAAAAAI 1L " 558Q 

TGTCATGTGG GATCCCTACT GTKCCTCCT GGTTTTACGT TACCUUbA ^ 554Q 

CAGCGGTTTG GAAAGAGAAA AAGAATTTGG AMTAAAACA TGTCTALb. . 570Q 

CCAGCATTTT GGTTTTTAAT TATGTCMTA ACTGGCTTAG A I HbbHA^ 576Q 

TGGGTGTATT ACCGAGGAAC AMGGAAGGC ™TATAAAC TCMblUi . 582Q 
ACTGGCAAGC TGTCAAAAAC AAAMGGCCT TACCACCAM }™'?^ G cCTGCTTGTG 5880 

GCCAGCAGGG CCAGCACGAG GGATGGTGCA CTGCTGGCAC TATGCCACGb ^ 4Q 

ACTCTGAGAG CAACTGCTTT GGAAATGACA GCACTTGGTG CMTT1LUI ^ em 
TGCGTAGAGC GTGTGCTTGG CGACAGTTTT TCTAGTTAGG CCACTTCTTt ^ 

TCCTCATTCT CCTAAGCATG TCTCCATGCT GGTAATCCCA GTCAAGlbAA gl20 

ATGAATCCAT CACTGTAGGA TTCTCGTGGT GATCAAATCT ™bi ^ 6m 
ATGGAAGCTT ATTTATTTTT CGTTCTTCCA I Al LAbiL m v. c TAGCTTTAC G 6240 
ACCACAGCAA ATTAAAGGTG AAGGAGGCTG GTGGGATGAA GAGGGTC^ ^ 



TTCTTCCTTG CAAGGCCACA GGAAAA^I ^ ScCTTCTCA GACTCATCTT 6360 
AGTTCAGTCT CCTGCTGGGA GAGCTMCCG CATCTTATM LLtu. i /\AGGCTATCT 6420 
AGGACCAAAT AGGGTCTATC TGGGGTTTTT GTTCCTGCTG ™KAJ* TA&GCC 6480 
CACTATTTCA CTGCTCCCAC GGTTACMAC CMAGAjACA GCCTGAATn M ^ ^ 

ACATTACATA AATTTGACCT GGTACCMTA TTGTTCTCTA > ^ ^^0^ 6 600 
CTGTGTTTAA CCCCTTAAGG CATTCAGAAC AACTAGAATC ATAGAftU,b. 666Q 
AGGGGCCTTA AACATCATCC ATTTCCAACC CTCTGCCATG GGCIbt ccc 6720 
CTCAGGCTGC CCAGGGCCCC ATCCAGCCTG GCCTTGAGCA GAATTCTCTT 6780 

ACAGCTTCTC TGGGCAGCCT GTGCCAACAC CTCACCACTC TTTCCCGTTG 6840 

TTAACATCTA ATCTAAATCT CTTCTCTTTT ACTTTAAAtt CATTCUU. ^ mQ 

CTATCTGTCC MGAAATGTGr TATTGGTCTC CCTCCT^l Jj^ C CAGCTCCTT 6960 
GGCTGCAGTG AGGTCTCCCC ACAGCCTTCT CTTCTCCAbb li'^^ CCAACAGTT C 7020 
CAGCCTGTCT TCGTAGGAGA TCATCTTAGT GGCCCTCCTC TGGACCOT ^TT 7080 
CACGGCTTTC TTGTGGAGCC CCAGGTCTGG ATGCAG1AU '^'^ ^^^q 7140 

Sagagcaga tggggacaat cgcttacccc tccctgctgg ctggccctct TTTbA CTTTT 7200 

CCCAGGGTAC TGTTGGCCTT TCAGGCTCCC AGACCCCTlb UbA . u jaagCTTGTT 7260 

catccaccag aacccacgct tcctggttm ^cttctgcc cTCACTTcib ^ T 7320 

TCAGGAGACT TCCATTCTTT AGGACAGACT GTGTTACACC TACt lb£ CTAACAAAAA 7380 
ATATACATTT CAGTTCATGT JTCCTGTAAC AGGACAGAAT Alb. A GCACATAGT 7440 
. TACATGCAGA ATTCCTAGTG CGATCTCAGT AGGGTTTTCA TTGGGATCAG 7500 

CAATTTGCTG CAAGTACCTT CCAAGCTGCG GCCTCCCATA TGTGCTTCAG 7560 

TTACCTTTTG GGGTAAGCTT TTGTATCTGC AGAGACCCTG WBGllUbA TACM CTrCC 7620 
CTCTGCTCTG TTCTGACTGC ACGATTTTCT AGATCACCCA bMbn -nTGTGTTTG 7680 
TTGTCCTCCA TCCTTTCCCA GCTTGTATCT nGACMATA "gj^C TGGAACAGGA 7740 
CTTCAGCAGC CATTTAATTC TTCAGTGTCA TCTTGTTCTG TlbAi bu.Au m( , 7800 

TTTTCAGCAG TCTTGCAAAG AACATCTAGC TGAAAACTTT CTGCCATTCA ATAIU^ia 
CAGTTCTTCT TGTTTGAGGT GAGCCATAM ™CTAGAAC TTCGTCACTC ACMb^ ^ 

SEBSE ?S Sffi S ~ ™g * 
S A A SSS c cK SS5S Sgg^ca 8 ioo 
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TCAAGGGAGA GACA6CTGAA GGGTTGTGTA GCTCAATAGA ATTAAGAAAT AATAAAGCTG 8160 
TGTCAGACAG TTTTGCCTGA TTTATACAGG CACGCCCCAA GCCAGAGAGG CTGTCTGCCA 8220 
AGGCCACCTT. GCAGTCCTTG GTTTGTAAGA TAAGTCATAG GTAACTTTTC TGGTGAATTG 8280 
CGTGGAGAAT CATGATGGCA GTTCTTGCTG TTTACTATGG TAAGATGCTA AAATAGGAGA 8340 
CAGCAAAGTA ACACTTGCTG CTGTAGGTGC TCTGCTATCC AGACAGCGAT GGCACTCGCA 8400 
CACCAAGATG AGGGATGCTC CCAGCTGACG GATGCTGGGG CAGTAACAGT GGGTCCCATG 8460 
CTGCCTGCTC ATTAGCATCA CCTCAGCCCT CACCAGCCCA TCAGAAGGAT CATCCCAAGC 8520 
TGAGGAAAGT TGCTCATCTT CTTCACATCA TCAAACCTTT GGCCTGACTG ATGCCTCCCG 8580 
GATGCTTAAA TGTGGTCACT GACATCTTTA TTTTTCTATG ATTTCAAGTC AGAACCTCCG 8640 
GATCAGGAGG GAACACATAG TGGGAATGTA CCCTCAGCTC CAAGGCCAGA TCTTCCTTCA 8700 
ATGATCATGC ATGCTACTTA GGAAGGTGTG TGTGTGTGAA TGTAGAATTG CCTTTGTTAT 8760 
TTTTTCTTCC TGCTGTCAGG AACATTTTGA ATACCAGAGA AAAAGAAAAG TGCTCTTCTT 8820 
GGCATGGGAG GAGTTGTCAC ACTTGCAAAA TAAAGGATGC AGTCCCAAAT GTTCATAATC 8880 
TCAGGGTCTG AAGGAGGATC AGAAACTGTG TATACAATTT CAGGCTTCTC T6AATGCAGC 8940 
TTTTGAAAGC TGTTCCTGGC CGAGGCAGTA CTAGTCAGAA CCCTCGGAAA CAGGAACAAA 9000 
TGTCTTCAAG GTGCAGCAGG AGGAAACACC TTGCCCATCA TGAAAGTGAA TAACCACTGC 9060 
CGCTGAAGGA ATCCAGCTCC TGTTTGAGCA GGTGCTGCAC ACTCCCACAC TGAAACAACA 9120 
GTTCATTTTT ATAGGACTTC CAGGAAGGAT CTTCTTCTTA AGCTTCTTAA TTATGGTACA 9180 
TCTCCAGTTG GCAGATGACT ATGACTACTG ACAGGAGAAT GAGGMCTAG CTGGGAATAT 9240 
TTCTGTTTGA CCACCATGGA GTCACCCATT TCTTTACTGG TATTTGGAAA TAATAATTCT 9300 
GAATTGCAAA GCAGGAGTTA GCGAAGATCT TCATTTCTTC CATGTTGGTG ACAGCACAGT 9360 
TCTGGCTATG AAAGTCTGCT TACAAGGAAG AGGATAAAAA TCATAGGGAT AATAAATCTA 9420 
AGTTTGAAGA CAATGAGGTT TTAGCTGCAT TTGACATGAA GAAATTGAGA CCTCTACTGG 9480 
ATAGCTATGG TATTTACGTG TCTTTTTGCT TAGTTACTTA TTGACCCCAG CTGAGGTCAA 9540 
GTATGAACTC AGGTCTCTCG GGCTACTGGC ATGGATTGAT TACATACAAC TGTAATTTTA 9600 
GCAGTGATTT AGGGTTTATG AGTACTTTTG CAGTAAATCA TAGGGTTAGT AATGTTAATC 9660 
TCAGGGAAAA AAAAAAAAAG CCAACCCTGA CAGACATCCC AGCTCAGGTG GAAATCAAGG 9720 
ATCACAGCTC. AGTGCGGTCC CAGAGAACAC AGGGACTCTT CTCTTAGGAC CTTTATGTAC 9780 
AGGGCCTCAA GATAACTGAT GTTAGTCAGA AGACTTTCCA TTCTGGCCAC AGTTCAGCTG 9840 
AGGCAATCCT GGAATTTTCT CTCCGCTGCA CAGTTCCAGT CATCCCAGTT TGTACAGTTC 9900 
TGGCACTTTT TGGGTCAGGC CGTGATCCAA GGAGCAGAAG TTCCAGCTAT GGTCAGGGAG 9960 
TGCCTGACCG TCCCAACTCA CTGCACTCAA ACAAAGGCGA AACCACAAGA GTGGCTTTTG 10020 
TTGAAATTGC AGTGTGGCCC AGAGGGGCTG CACCAGTACT GGATTGACCA CGAGGCAACA 10080 
TTAATCCTCA GCAAGTGCAA TTTGCAGCCA TTAAATTGAA CTAACTGATA CTACAATGCA 10140 
ATCAGTATCA ACAAGTGGTT TGGCTTGGAA GATGGAGTCT AGGGGCTCTA CAGGAGTAGC 10200 
TACTCTCTAA TGGAGTTGCA TTTTGAAGCA GGACACTGTG AAAAGCTGGC CTCCTAAAGA 10260 
GGCTGCTAAA CATTAGGGTC AATTTTCCAG TGCACTTTCT GAAGTGTCTG CAGTTCCCCA 10320 
TGCAAAGCTG CCCAAACATA GCACTTCCAA TTGAATACAA TTATATGCAG GCGTACTGCT 10380 
TCTTGCCAGC ACTGTCCTTC TCAAATGAAC TCAACAAACA ATTTCAAAGT CTAGTAGAAA 10440 
GTAACAAGCT TTGAATGTCA TTAAAAAGTA TATCTGCTTT CAGTAGTTCA GCTTATTTAT 10500 
GCCCACTAGA AACATCTTGT ACAAGCTGAA CACTGGGGCT CCAGATTAGT GGTAAAACCT 10560 
ACTTTATACA ATCATAGAAT CATAGAATGG CCTGGGTTGG AAGGGACCCC AAGGATCATG 10620 
AAGATCCAAC ACCCCCGCCA CAGGCAGGGC CACCAACCTC CAGATCTGGT ACTAGACCAG 10680 
GCAGCCCAGG GCTCCATCCA ACCTGGCCAT GAACACCTCC AGGGATGGAG CATCCACAAC 10740 
CTCTCTGGGC AGCCTGTGCC AGCACCTCAC CACCCTCTCT GTGAAGAACT TTTCCCTGAC 10800 
ATCCAATCTA AGCCTTCCCT CCTTGAGGTT AGATCCACTC CCCCTTGTGC TATCACTGTC 10860 
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T aaaacttcat TCTCCTCCTT TTTGGAAGGT TGCAATGAGG TCTCCTTGCA 10920 
JJrrnTTCT CTTCTGCAGG ATGMCMGC CCAGCTCCCT CAGCCTGTCT TTATAGGAGA 10980 

Sag aSgg ccctcctctg gacccgcfcc m^gctcca iio4 

r attttttpt GTArTRGGGG CCCCAGGCCT GAATGGAGTA CTCCAGATGG GGCCTCAAAA lllUU 
S AAGAGGGACA AT™ TCACCCTGCT GGCCAGCCCT. CTTCTGATGG 11160 
AGCCCTGGAT ACAACTGGCT TTCTGAGCTG CAACHCTCC TTATCAGTTC CACTATTAAA 11220 
ACAGGAACAA TACAACAGGT GCTGATGGCC AGTGCAGAGT TTTTCACACT TCTTCATTTC 11280 

«S SS SEES ™ B iSS8 

SccS CACTGCTTCA GATGCTGTTT AATAGCACAG TATGACCAAG 11460 
Srrw ?I^CACAA ACAATGTGTT GCATCCTTCA GCACTTGAGA AGAAGAGCCA 11520 
TTGCACCTAT ^ACACAA AtMlblbl^ jxcTGCCAAT TAAAACTTGT TTATCTACCA 11580 

iSSSfS TAT^CTGTT AGTAGTGGTA CACTGATGAT GAACAATGGC TATGCAGTAA 11640 
SSSJS StI?TG CMCAGACTA TMAATTCCT CTGTGGCTTA GCCAATGTGG 11700 
?£™S StCTATaH AMTTTGGCA AGTTTAGAGC AATGTTTGAA GTGTTGGGAA 11760 
SS KSSI ScSg CMCTGTAGA AGAGAGGAAT CMAAGGGGG 11820 

IS SSKES Sffi S SSSSS iSSo 



TAGGG 11945 
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SEQ ID NO: 8 

AAAGTCTAGAGTC6G6GCGGCCG6CCGCTTCGAGCAGACATGATAA6ATACA1T6ATGAG 60 

TnGGACAMCCACMCTAGMTGCAGTGAAAAAMTGCTTTATTTGTGAAATTTGTGAT 120 

GCTATTGCTTTATTTGTAACCATTATAAGCTGCAATAAAC AAGTTAA CAACAACAATTGC 180 

ATTCATTTTATGTTTCAGGTTCAGGGGGAGGTGTGGGAGGTTTTTTAAAGCAAGTAAAAC 240 
CTCTACAAATGTGGTAAAATCGATAAGGATCCGTCGAGCGGCCGC 285 
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SEQ ID NO; 9 

1 CGCGTGGTAGGTGGCGGGGGGTTCCCAGGAGAGCCCCCAGCGCGGACGGC • 
AGCGCCGTCACTCACCGCTCCGTCTCCCTCCGCCCAGGGTCGCCTGGCGC 
AACCGCTGCAAGGGCACCGACGTCCAGGCGTGGATCAGAGGCTGCCGGCT 
GTGAGGAGCTGCCGCGCCCGGCCCGCCCGCTGCACAGCCGGCCGCTTTGC 

200 GAGCGCGACGCTACCCGCTTGGCAGTTTTAAACGCATCCCTCATTAAAAC 
GACTATACGCAAACGCCTTCCCGTCGGTCCGCGTCTCTTTCCGCCGCCAG 
GGCGACACTCGCGGGGAGGGCGGGAAGGGGGCCGGGCGGGAGCCCGCGGC 
CAACCGTCGCCCCGTGACGGCACCGCCCCGCCCCCGTGACGCGGTGCGGG 

40 0 CGCCGGGGCCGTGGGGCTGAGCGCTGCGGCGGGGCCGGGCCGGGCCGGGG 
CGGGAGCTGAGCGCGGCGCGGCTGCGGGCGGCGCCCCCTCCGGTGCAATA 
TGTTCAAGAGAATGGCTGAGTTCGGGCCTGACTCCGGGGGCAGGGTGAAG 
GTGCGGCGCGGGCGGAGGGACGGGGCGGGCGCGGGGCCGCCCGGCGGGTG 

600 CCGGGGCCTCTGCGGGCCCGCCCGGCTCGGGCTGCTGCGGCGCTTACGGG 
CGCGCTTCTCGCCGCTGCCGCTTCTCTTCTCTCCCGCGCAAGGGCGTCAC 
CATCGTGAAGCCGGTAGTGTACGGGAACGTGGCGCGGTACTTCGGGAAGA 
AGAGGGAGGAGGACGGGCACACGCATCAGTGGACGGTTTACGTGAAGCCC 

800 TACAGGAACGAGGTAGGGCCCGAGCGCGTCGGCCGCCGTTCTCGGAGCGC 
CGGAGCCGTCAGCGCCGCGCCTGGGTGCGCTGTGGGACACAGCGAGCTTC 
TCTCGTAGGACATGTCCGCCTACGTGAAAAAAATCCAGTTCAAGCTGCAC 
GAGAGCTACGGGAATCCTCTCCGAGGTGGGTGTTGCGTCGGGGGGTTTGC 
1000 TCCGCTCGGTCCCGCTGAGGCTCGTCGCCCTCATCTTTCTTTCGTGCCGC 
AGTCGTTACCAAACCGCCGTACGAGATCACCGAAACGGGCTGGGGCGAAT 
nGAAATCATCATCAAGATATTTTTCATTGATCCAAACGAGCGACCCGTA 
AGTACGCTCAGCTTCTCGTAGTGCTTCCCCCGTCCTGGCGGCCCGGGGCT 

1200 GGGCTGCTCGCTGCTGCCGGTCACAGTCCCGCCAGCCGCGGAGCTGACTG 
AGCTCCCTTTCCCGGGACGTGTGCTCTGTGTTCGGTCAGCGAGGCTATCG 
GGAGGGCTTTGGCTGCATTTGGCTTCTCTGGCGCTTAGCGCAGGAGCACG 
TTGTGCTACGCCTGAACTACAGCTGTGAGAAGGCCGTGGAAACCGCTCTC 

1400 AAACTGATTTATTGGCGAAATGGCTCTAAACTAAATCGTCTCCTCTCTTT 
GGAMTGCTTTAGAGAAGGTCTCTGTGGTAGTTCTTATGCATCTATCCTA 
AAGCACTTGGCCAGACAATTTAAAGACATCAAGCAGCATTTATAGCAGGC 
ACGTTTAATAACGAATACTGAATTTAAGTAACTCTGCTCACGTTGTATGA 

1600 CGfrTTATTnCGTATTCCTGAAAGCCATTAAMTCCTGTGCAGTTGTTTA 
GTMGMCAGCTGCCACTGTTnGTATCTAGGAGATMCTGGTGTTTCCC 
TACAGTTCTCMGCTGATAAAACTCTGTCTTTGTATCTAGGTAACCCTGT 
ATCACTTGCTGMGCTTTTTCAGTCTGACACCMTGCAATCCTGGGAAAG 

1800 AAAACTGTAGTTTCTGAATTCTATGATGAAATGGT ATGAAAATTTTAATG 
TCMCCGAGCCTGACTTTATnAAAAAAAATTATTGATGGTGCTGTGTAT 
' TTTGGTCCTTCCTTAGATATTTCAAGATCCTACTGCCATGATGCAGCAAC 
TGCTAACGACGTCCCGTCAGCTGACACTTGGTGCTTACAAGCATGAAACA 
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2000 GAGTGTAAGTGCAAAAT6A6GATACCTTCGCCGACCGTCATTCACTACTA 
ATGTTTTCTGTGGGATCTGATCGTACAGTGAGTTTGGCTGTGTGAAATTT 
GAATAGCTTGGTATTGGCAGTGATGACGTGATCGATGCCTTGCTTATCAT 
GTTTGAMTGMGTAGMTAMTGCAGCCTGCTTTATTTGAGATAGTTTG 

2200 GTTCATnTATGGAATGCAAGCAAAGATTATACTTCCTCACTGAATTGCA 
CTGTCCAAAGGTGTGAAATGTGTGGGGATCTGGAGGACCGTGACCGAGGG 
ACATTGGATCGGTATCTCCCAmCTTTTGCTGTTACCAGTTCAGATTTT 
CTTnCACCTAGTCTTTAATTCCCAGGGTTTTG 1 1 1 1 1 1 CCTTGGTCATA 

2400 GTTTTTGTTTTTCACTCTGGCAAATGATGTTGTGAATTACACTGCTrCAG 
CCACAAAACTGATGGACTGAATGAGGTCATCAAACAAACTTTTCTTCTTC 
CGTATTTCC 1 1 1 1 1 I I I CCCCCACTTATCATTTTTACTGCTGTTGTTGAG 
TCTGTMGGCTAAMGTMCTGTTTTGTGCTTTTTCAGGACGTGTGCTTT 

2600 CCAMTTACTGCCA CATATAT AMGAMGGTTGGAATTTTAAAGATAATT 
CATGTTTCTTCTTC I 1 1 1 1 1 GC C ACC ACAGTTGC AGATCTTGAAGTAAAA 
ACCAGGGAAMGCTGGMGCTGCCAAAAAGAAAACCAGTTTTGAAATTGC 
TGAGCTTAAAGAAAGGTTAAAAGCAAGTCGTGAAACCATCAACTGCTTAA 

2800 AGAGTGAAATCAGAAAACTCGAAGAGGATGATCAGTCTAAAGATATGTGA 
TGAGTGTTGACTTGGCAGGGAGCCTATAATGAGAATGAAAGGACTTCAGT 
CGTGGAGTTGTATGCGTTCTCTCCAATTCTGTAACGGAGACTGTATGAAT 
TTCATTTGCAMTCACTGCAGTGTGTGACMCTGACTTTTTATAAATGGC 

3000 AGAAAACAAGAATGAATGTATCCTCATnTATAGTTAAAATCTATGGGTA 
TGTACTGGTTTATTTCAAGGAGAATGGATCGTAGAGACTTGGAGGCCAGA 
TTGCTGCTTGTATTGACTGCATTTGAGTGGTGTAGGAACATTTTGTCTAT 
GGTCCCGTGTTAGTTTACAGMTGCCACTGTTCACTGTTTTGTTTTGTAT 

3200 TTTACTTTTTCTACTGCM CGTCM GGTTTTAAAAGTTGAAAATAAAACA 
TGCAGGI 1 1 1 1 1 1 IAAATAI Mill I GTCTCTATCCAGTTTGGGCTTCAA 
GTATTATTGTTAACAGCAAGTCCTGATTTAAGTCAGAGGCTGAAGTffTAA 
TGGTATTCAAGATGCTTAAGTCTGTTGTCAGCAAAACAAAAGAGAAAACT 

3400 TCATAAAATCAGGAAGTTGGCATTTCTAATAACTTCTTTATCAACAGATA 
AGAGTTTCTAGCCCTGCATCTACTTTCACTTATGTAGTTGATGCCTTTAT 
ATTTTGTGTGTTTGGATGCAGGAAGTGATTCCTACTCTGTTATGTAGATA 
TTCTATTTMCACTTGTACTCTGCTGTGCTTAGCCTTTCCCCATGAAAAT 

3600 TCAGCGGCTGTAAATCCCCCTCnCTTTTGTAGCCTCATACAGATGGCAG 
ACCCTCAGGCTTATAMGGCTTGGGCATCTTCTTTACTGCTTTGAGATTC 
TGTGTTGCAGTAACCTCTGCCAGAGAGGAGAAAAGCCCCACAAACCTCAT 
CCCCTTCTTCTATAGCAATCAGTATTACTAATGCTTTGAGAACAGAGCAC 

3800 TGG^rTGAMCGTTTGATMTTAGCATTTMCATGGCTTGGTAAAGATGC 
AGAACTGAAACAGCTGTGACAGTATGAACTCAGTATGGAGACTTCATTAA 
GACAMCAGCTGTTAAAATCAGGCATGTTTCATTGAGGAGGACGGGGCAA 
CTTGCACCAGTGGTGCCCACACAAATCCTTCCTGGCGCTGCAGACCAATT 
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4000 ™^ 

4200 CCA(^GCTTmGGTAGMGCOT 

ACMGTAGAGAMCTGGCMCGGAGCTAT^TATCGM^ 
A Ar aa ARTTAATCGTCTTTTMGGTTTGGTTTATTCATTAMTTTGC 1 1 1 
STOKlSl^CSrGCTGTCTTC^aA^AG 
CTCTSTKmGTGCTCTCTWCTCACC«^^fCGGGATT 

4800 MWnwraS^^ 

^ouu ^„ rgmTGgr G . MiiiMIMl A TTTATTATTTCTTTGGGGTTCT 
TT^rATrn^ATCTTTCTC 

*nnn T^C/^(XTCAGAACTGC^ 



5600 CAGCACTTCCACCTGAAGAGATCMTOC^TCTTCAATGTCTAGnCT 
MCACTTMAAACCTOCATCTTAT^ 
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SEQ ID NO: 10 

1 TRrrGCCTTCTTTGATATTCACTCTGTTGTATTTCATCTCTTCTTGCCGA 

TTCTTCTGATCAGTGXTTTTATMCT 
151 TGTGTCCTnGTCTTGGGAGACAMGCCCACAGCAGG 
GGTGGCAGCTCAGTGACAGGAGAGffTTTrnTGC 
1 1 1 1 1 1 1 1 1 rMGTMGGTGTTCTTTTTTCTTAGTAMTTTTCTACTGGA 

m ===== 

SO! 

TAAATCTTTTCCTAGTTMCATGTTGATMCTTCGGATTTACATGTTGTA 

751 ACGTMTTCCTGATTTCCTTTTTTTTTA^ 
GTCAMCAGACTTCACTCCTATTTTTATn^ 
TRTrff^GGTTCTTGTGTTGTMGGATACAGCCTTAAATTTCCTAGAGCG 

901 SSgtS^ 
TTTCA^^A^GITG^^ 

1201 tSKccto^^ 

rATT(^AGATAGGGGTGGGGTGCTTTGTGTCGTGTTCCCACACGCTGCCA 

1501 Ssssss 

rArrrTrTTRTGTACTGTATTTTGTMTAGAAMTATTTTAMCT 
1801 SmSt^^ 

^i^nMMAAAAAAAAAAAMGTMTA^ 

1951 mrrEE^^ 

2101 1™rcSfiAA^^ 
2101 ffiSEro^^ 

GCCCMCTCCAMGGATnTATATGTGTATATAGTAAGCAGTlTCCTGAT 
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2251 TCCAGCAGGCCAAA6AGTCTGCTGAATGTTGTGTT6CCGGAGACCTGTAT 
TTCTCAACAAGGTAAGATGGTATCCTAGCMCTGCGGATTTTAATACATT 
TTCAGCAGAAGTACTTAGTTAATCTCTACCTTTAGGGATCGTTTCATCAT 

2401 TTTTAGATGTTATACTTGAMTACTGCATMCTTTTAGCTTTCATGGGTT 
CCTTTTTTTCAGCCTTTAGGAGACTGTTMGCMTTTGCTGTCCAACTTT 
TGTGTTGGTCTTAAACTGCAATAGTAGTTTACCTTGTATTGAAGAAATAA 

2551 AGACCATTTTTATATTAAAAMTACTTrTGTCTGTCTTCATTTTGACTTG 
TCTGATATCCTTGCAGTGCCCATTATGTCAGTTCTGTCAGATATTCAGAC 
ATCAAAACTTAACGTGAGCTCAGTGGAGTTACAGCTGCGGTTTTGATGCT 

270 1 GTTATTATTTCTGAAACTAGAAATGATGTTGTCTTCATCTGCTCATCAAA 
CACTTCATGCAGAGTGTAAGGCTAGTGAG AAATGCAT ACATTTATTGATA 
CTTTTnAMGTCMCTTTTTATCAGATTTTTTTTTCATTTG 

2851 TTGTTTTCTAGACTGCATAGCTTCTGAATCTGAAATGCAGTCTGATTGGC 
ATGAAGAAGCACAGCACTCTTCATCTTACTTAAACTTCATnTGGAATGA 
AGGAAGTTAAGCAAGGGCACAGGTCCATGAAATAGAGACAGTGCGCTCAG 

3001 GAGAAAGTGAACCTGGATTTCTTTGGCTAGTGTTCTAAATCTGTAGTGAG 
GAAAGTAAC AC C C GATTCCTTGAAAGGGCTC CAGCTTTAATGCTTCC AAA 
TTGAAGGTGGCAGGCAACTTGGCCACTGGTTATTTACTGCATTATGTCTC 

3151 AGTTTCGCAGCTAACCTGGCTTCTCCACTATTGAGCATGGACTATAGCCT 
GGCTTCAGAGGCCAGGTGAAGGTTGGGATGGGTGGAAGGAGTGCTGGGCT 
GTGGCTGGGGGGACTGTGGGGACTCCAAGCTGAGCTTGGGGTGGGCAGCA 

3301 CAGGGAAAAGTGTGGGTAACTATTTTTAAGTACTGTGTTGCAAACGTCTC 
ATCTGCAAATACGTAGGGTGTGTACTCTCGAAGATTAACAGTGTGGGTTC 
AGTAATATATGGATGAATTCACAGTGGAAGCATTCAAGGGTAGATCATCT 

3451 AACGACACCAGATCATCAAGCTATGATTGGAAGCGGTATCAGAAGAGCGA 
GGAAGGTAAGCAGTCTTCATATGTTTTCCCTCCACGTAAAGCAGTCTGGG 
AAAGTAGCACCCCTTGAGCAGAGACAAGGAAATAATTCAGGAGCATGTGC 

3601 TAGGAGMCTTTCTTGCTGAATTCTACTTGCAAGAGCTTTGATGCCTGGC 
TTCTGGTGCCTTCTGCAGCACCTGCAAGGCCCAGAGCCTGTGGTGAGCTG 
GAGGGAAAGATTCTGCTCAAGTCCAAGCnCAGCAGGTCATrGTCTTTGC 

3751 TTCTTCCCCCAGCACTGTGCAGCAGAGTGGAACTGATGTCGAAGCCTCCT 
GTCCACTACCTGTTGCTGCAGGCAGACTGCTCTCAGAAAAAGAGAGCTM 
CTCTATGCCATAGTCTGMIaGTAAMTGGGTTTTAAAAAAGAAAACACAA 

3901 AGGCAAAACCGGCTGCCCCATGAGAAGAAAGGAGTGGTAAACATGGTAGA 
AAAGGTGCAGAAGCCCCCAGGCAGTGTGACAGGCCCCTCCTGCCACCTAG 
AGGCGGGMCMGCmCCTGCCTAGGGCTCTGCCCGCGAAGTGCGTGTT 

4051 TCTTTGGTGG GI 1 1 IGI 1 1 G GCGTTTGGTnTGAGATTTAGACACAAGGG 
MGCCTGAMGGAGGTGTTGGGCACTATTTTGGTTTGrrAAAGCCTGTAGT 
TCAMTATATATTTTGTGAGGGAGTGTAGCGAATTGGCCAATTTAAAATA 

4201 MGTTGCMGAGATTGMGGCTGAGTAGTTGAGAGGGTAACACGTTTAAT 
GAGATCTTCTGAMCTACTGCTTCTAAACACTTGTTTGAGTGGTGAGACC 
TTGGATAGGTGAGrTGCTCTTGTTACATGTCTGATGCACTTGCTTGTCCTT 

4351 TTCCATCCACATCCATGCATTCCACATCCACGCATTTGTCACTTATCCCA 
TATCTGTCATATCTGACATACCTGTCTCTTCGTCACTTGGTCAGAAGAAA 
CAGATGTGATMTCCCCAGCCGCCCCAAGTTTGAGAAGATGGCAGTTGCT 

4501 TCmCCCTTmCCTGCTMCTMGGATTTTCTCCTGGCTTTGACACCT 
CACGAMTAGTCTTCCTGCCTTACATTCTGGGCATTATTTCAAATATCTT 
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. TGGAGTGCGCTGCTCTCMGTnGTGTCTTCCTACTCTTAGAGTGMTGC 
4651 TCTTAGAGTGAAAGAGAAGGAAGAGAAGATGTTGGCCGCAGTTCTCTGAT 
GAACACACCTCTGMTMTGGCCAMGGTGGGTGGGTnCTCTGAGGAAC 
GGGCAGCGTTTGCCTCTGAAAGCAAGGAGCTCTGCGGAGTTGCAGTTATT 

4801 SKSSS^^ 

CCACCCCCACTGCAGGCTTAGAMGTATGTGGCTCTGCCTO 
4951 CAGCTCTGCCCTGGTGAAAGGGGATTAAAACGGGCACCATTCATCCCAAA 
CAGGATCCTCATTCATGGATCMGCTGTMGGMCnGGGCTCCMCCTC 
AAAACATTAATTGGAGTACGAATGTAATTAAMCTGCATTCTCGCATTCC 
5101 TAAGTCATTTAGTCTGGACTCTGCAGCATGTAGGTCGGCAGCTCCCACTT 
TCTCAMGACCACTGATGGAGGAGTAGTAAAMTGGAGACCGATTCAGAA 
CAACCAACGGAGTGTTGCCGAAGAAACTGATGGAAATAATGCATGAATTG 

5?51 TCTGGTGGAC A I I I 1 1 1 1 I A AATACATAAACTACTTCAAATGAGGTCGGA 
GMGGTCAGTGTTTTATTAGCAGCCATAAMCCAGGTGAGCGA 
TT^^TCTACMGAAAMCGATTCTGAGCTCTGCGTMGTATMGTTCTC 

5401 CATAGCGGCTGAAGCTCCCCCCTGGCTGCCTGCCATCTCAGCTGGAGTGC 

5 A^^ 

ACAAAMnCTTTCTTTTCCTGTCATGTGGGATCCCTACTGTGCCCTCCT 

5551 GGTTTTACGTTACCCCCTGACTGTITCCATTCAGCGGTnGGAMGAGAAA 
AAGAATTTEKyMT^ 

GGTTTTTMTTATGTCMTMCTGGCTTAGATn 
570 1 TGGGTGTATTACCGAGGMCAMGGMGGCTTATATAMCTCMGTCTTT 
TAT^AGAGAACTGGCAAGCTGTCAAAAACAAAAAGGCC"rrACCACCAAA 

HAAGTGAATAGCCGCTATA^^ 

5851 CTgIWaCTAT&CCACGGCC^^ 

GGAMTGACA^ACnGGTGCMmCCmGmCAGAA^GTA^GC 

GTGTGCTTGGCGACAGnTTTTCTAGTTAGGCCACTTCTTTTn 
fiOOl TCCTCATTCTCCTAAGCATGTCTCCATGCTGGTAATCCCAGTCAAGTGAA 

TTGTGTGAGGTCTATAAAATATGGMGCmmATTTTTC^ 

6151 TATO«rcrnr^ 

MGGAGGCTGGTGGGATGMGAGGGTCmTAGC^ 
CMGGCCACAGGAAMTGCTGAGAGCTGTAGAATACAGCCT 
6301 AGTTCAGTCTCCTGGTGGGACAGCTMCCGCATCTTATM 

GACTCATCTTAGGACCAMTAGGGTCTATCTGGGGTTTTTGTTCCTGCTG 
^CCTCCTGGAAG^TATCTCAGTATTTCACTGCTCCCACGG^ 

6451 SESSE^^ 

6601 

CCTCCAGGGATGGGGCACCCACAGCTTCTCTGGGCAGCCTGT^ 

6751 crSccScnnoCT 

CTTCTCTTTTAGTrrAMGCCATTCCTCTTTTTCCCGTTGCTATCTGTCC 
6901 GGCTGCAGTGAGCTCTCCCCACAGCCTTCTCTTCTCCAGGCTGMCMG^ 
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CCA6CTCCTTCAGCCTGTCTTCGTAGGAGATCATCTTAGTGGCCCTCCTC 
TGGACCCATTCCAACAGTTCCACGGCTTTCTTGTGGAGCCCCAGGTCTGG 

7051 ATGCAGTACTTCAGATGGGGCCTTACAAAGGCAGAGCAGATGGGGACAAT 
CGCTTACCCCTCCCTGCTGGCTGCCCCTGTTTTGATGCAGCCCAGGGTAC 
TGnGGCCTTTCAGGCTCCCAGACCCCTTGCTGATTTGTGTCAAGCTTTT 

7201 CATCCACCAGMCCCACGCnCCTGGTTMTACTTCTGCCCTCACTTCTG 
TMGCTTGTTTCAGGAGACTTCCATTCTTTAGGACAGACTGTGTTACACC 
TACCTGCCCTATTCTTGCATATATACATTTCAGTTCATGTTTCCTGTAAC 

7351 AGGACAGMTATGTATTCCTCTMCAAAMTACATGCAGMTTCCTAGTG 
CCATCTCAGTAGGGTTTTCATGGCAGTATTAGCACATAGTCAATTTGCTG 
CAAGTACCTTCCAAGCTGCGGCCTCCCATAAATCCTGTATTTGGGATCAG 

7501 TTACCTTTTGGGGTAAGCTTTTGTATCTGCAGAGACCCTGGGGGTTCTGA 
TGTGCTTCAGCTCTGCTCTGTTCTGACTGCACCATTTTCTAGATCACCCA 
GTTGTTCCTGTACMCTTCCTTGTCCTCCATCCTTTCCCAGCTreTATCT 

7651 TTGACAMTACAGGCCTATTmGTGm^ 

TTCAGTGTCATCTTGTTCTGnGATGCCACTGGAACAGGATTTTCAGCAG 

TCTTGCAMGAACATCTAGCTGAAMCTTTCTGCCATTCAATATTCTTAC 
7801 CAGTTCTTCTTGTTTGAGGTGAGCCATAAATTACTAGAACTTCGTCACTG 

ACA^T^ATGCATTTTATO 

ACAGACACGCACATATTnGCTGGGATTTCCACAGTGTCTCTGTGTCCT^ 
795 1 CACATGGTTTTACTGTCATACTTCCG TTATAAC CTTGGCAATCTGCCCAG 

CTGCCCATCACMGAAMGAGAnCCTTTTTTATTACTTCT^ 
ATAAACAAAATGTGAGAAGCCCAAACAAGAACTTGTGGGGCAGGCTGCCA 

8101 TCMGGGAGAGACAGCTGAAGGGTTGTGTAGCTCAATAGMTTM 

MTAMGCTGTGTCAGACAGTTnGCCTGATTTATACAGGQ^GCCCCAA 
GCCAGAGAGGCTGTCTGCCAAGGCCACCTTGCAGTCCTTGGTTTGTAAGA 

8251 TMGTCATAGGTMCTTTTCTGGTGAATTGCGTGGAGAATCATGATGGCA 
GTTCTTGCTGTnACTATGCTMGATGCTAAMTAGGAGACAGCAMGTA 
ACACTTGCTGCTGTAGGTGCTCTGCTATCCAGACAGCGATGG^CTCGCA 

8401 CACCMGATGAGGGATGCTCCCAGCTGACGGATGCTGGGGCAGTMCAGT 
G^TCCCATGCTGCCTGCTCATTAGCATCACCTCAGCCCTCACCAGCCCA 

TCAGAAGGATCATCCCMGCTGAGGAMGTTGCTC^ 
8551 TCAMCCmGGCCTGACTGATGCCTCCCGGATGCnAMTGT^ 
GACATCTTTATTTTTCTATGATTTCAAGTCAGAACCTCCGGATCAGGAGG 
GMCACATAGTGGGMTGTACCCTCAGCTCCMGGCCAGATCTTCCTTCA 




8701 

Ulll II 



CCTTTGTTATTT 1 1 ILI lU/.lubiui umuuwwa. m ■ 
AAMGAAMGTGCTCTTCTTGGCATGGGAGGAGTTGTCACACTTGCAAAA 

8851 TAMGGATGCAGTCCCAAATGnCATMTCTCAGGGTCTGAAGGAG^TC 
a a ptctctat ap a atttp ARRnTTCTCTGAATGCAGCTTTTGAAAGC 



AGAAACTGTGTA I ALAA 1 1 1 law*. ii u i u i a«« . ■ 1'^";»a 
TGTTCCTGGCCGAGGCAGTACTAGTCAGMCCCTCGGAMCAGGMCAAA 

9001 TGTCTTCMGGTGCAGCAGGAGGAAACAC(^ 

TAACCACTGCCGCTGAAGGAATCCAGCTCCTGTTTGAGCAGGTGCTGCAC 

ACTCCCACACTGAMCMCAGTTCATTTTTATAGGACTTCCAGGMGGAT 

9151 CTTCTTCTTMGCTTCTTMTTATGGTACATCTCCAGTTGGCAGATGACT 

ATGACTACTGACAGGAGAATGAGGAACTAGCTGGGM^ 
CCACCATGGAGTCACCCATTTCTnACTGGTATTTGGAAATAATAATTCT 
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9301 GMTTGCAMGCAGGAGTTAGCGMGATCTTCATTTCTTCCATGTT^TG 
ACAGCACAGTTCTGGCTATGAMGTCTGCTTACMGGAAGAGGATMAAA 
9401 ' TCATAGGGATMTAAATCTAAGTTTGAAGACAATGAGGTnTAGCTGCAT 
TTGACATGAAGAAATTGAGACCTCTACTGGATAGCTATGGTATTTACGTG 
TCTTTTTGCTTAGTTACTTATTGACCCCAGCTGAGGTCAAGTATGAACTC 
9551 AGGTCTCTCGGGCTACTGGCATGGATTGATTACATACMCTGTAATTTrA 
GCAGTGATTTAGGGTTTATGAGTACTTTTGCAGTAAATCATAGGGTTAGT 
AATGTTAATCTCAGGGAAAAAAAAAAAAAGCCAACCCTGACAGACATCCC 
9701 AGCTCAGGTGGAAATCAAGGATCACAGCTCAGTGCGGTCCCAGAGAACAC 
AGGGACTCTTCTCTTAGGACCTTTATGTACAGGGCCTCAAGATAACTGAT 
GTTAGTC AGAAGACTTTC CATTCTGGCC AC AGTTC AGCTGAGGC AATC CT 
9851 GGMTTTTCTCTCCGCTGCACAGTTCCAGTCATCCCAGTTTGTACAGTTC 



TGGCACTTTTTGGGTCAGGCCGTGATCCAAGGAGCAGAAGTTCCAGCTAT 
GGTCAGGGAGTGCCTGACCGTCCCAACTCACTGCACTCAAACAAAGGCGA 

10001 MCCACMGAGTGGCTTTTGTTGAMTTGCAGTGTGGC^CCAG^ 

CACCAGTACTGGATTGACCACGAGGCAACATTAATCCTCAGCAAGTGCAA 
T^GCAGCCATTAAATTGAACTAACTGATACTACAATGCAATCAGTATCA 
10151 ACAAGTGGTTTGGCTTGGAAGATGGAGTCTAGGGGCTCTACAGGAGTAGC 
iUlbl ALAAbiuui i muo i . imcA . r p.o RarA r T RTGAAAAGCTGGC 



TACTCTCTMTGGAGTTGCATTTTGAAGCAGGACACTGTGAAAAGCTGGC 
CTCCTAMGAGGCTGCTAMCATTAGGGTCMTTTTCCAGTGCACTTTCT 

10301 GMGTGTCTGCAGTTCCCCATGCAAAGCTGC^^ 

TTGAATACAATTATATGCAGGCGTACTGCTTCTTGCCAGCACTGTCCTTC 

TCAMTGMCTCMCAMCMTTTCAAAGTCTAGTAGAAAGTMCAAGCT 

10451 TTGMTGTCATTAAAAAGTATATCTGCTTTCAGTAGTTCAGCTTATTTAT 

10451 SHEE^^ 

GGfrAAAACCTACTTTATACAATCATAGAATCATAGAATGGCCTGGGTTGG 
10601 AAGGGACCCCAAGGATCATGAAGATCCMCACCCCCKCA£Affi^GGK 

ACCTGGCCATGMCACCTCCAGGGATOG^TCC^ACM 

10 ^ fflg^^ 

TATCACTGTCTACTCTTGTAAAM 
1 0901 TGCAATGAGGTCTCCTTGCAGCCTTCTTCTCTTCTGCAGGATGAACAAGC 

11061 ssssss 

C^CTGATGGAGCCCTGGATACMCTGGCTTTCTGAGCTGCAACTTCTCC 
11201 TTATCAGTTCCACTATTAAAACAGGAACAATACAACAGGTGCTGATGGCC 
AGTGCAGAG^CACACmTTCAmCGGTAGATC™ 

CGTTCAAG^GTK 
11351 CTGATACCTCACCCCACCTGCCACTGAATGGCTCCATGGCCCCCTGCAGC 

CAGGGCCCTGATGMCCCGGCACTGCTTCAGATGCTGTTTMT^ 
TATGACCMGTTGCACCTATGMTACACAMCMTGTGTTGCATCCTTCA 
11501 GCACTTGAGMGMGAGCCAAATTTGCATTGTCAGGAAATGGTTTAGTM 
^CTGCC/^^AAAACTTGTTTATCTACCATGGCTGTlTrrATGGCTG^ 
AGTAGTGGTACACTGATGATGAACAATGGCTATGCAGTAAAATCAAGACT 
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11651 GTAGATATTGCAACAGACTATAAAATTCCTCTGTGGCTTAGCCAATGTGG 
TACTTCCCACATTGTATMGAAATTTGGCMGT TTAGAG CAATGTTTGAA 
GTGTTGGGAMTTTCTGTATACTCAAGAGGGCGTTTTTGACAACTGTAGA 
11801 ACAGAGGAATCAAAAGGGGGTGGGAGGAAGTTAAAAGMGAGGCAGGTGC 
AAGAGAGCTTGCAGTCCCGCTGTGTGTACGACACTGGCAACATGAGGTCT 
TTGCTAATCTTGGTGCTTTGCTTCCTGCCCCTGGCTGCCTTAGGGTGCGA 
11951 TCTGCCTCAGACCCACAGCCTGGGCAGCAGGAGGACCCTGATGCTGCTGG 
CTCAGATGAGGAGAATCAGCCTGTTTAGCTGCCTGAAGGATAGGCACGAT 
TTTGGCTTTCCTCMGAGGAGTTTGGCAACCAGTTTCAGAAGGCTGAGAC 
12101 CATCCCTGTGCTGCACGAGATGATCCAGCAGATCTTTAACCTGTTTAGCA 
CCAAGGATAGCAGCGCTGCTTGGGATGAGACCCTGCTGGATAAGTTTTAC 
ACCGAGCTGTACCAGCAGCTGAACGATCTGGAGGCTTGCGTGATCCAGGG 
12251 CGTGGGCGTGACCGAGACCCCTCTGATGAAGGAGGATAGCATCCTGGCTG 
TGAGGAAGTACTTTCAGAGGATCACCCTGTACCTGAAGGAGAAGAAGTAC 
AGCCCCTGCGCTTGGGAAGTCGTGAGGGCTGAGATCATGAGGAGCTTTAG 
1240 1 CCTGAGCACC AACCTGCMGAGAGCTTGAGGTCTAAGGAGTAAAAAGTCT 
AGAGTCGGGGCGGCGCGTGGTAGGTGGCGGGGGGTTCCCAGGAGAGCCCC 
CAGCGCGGACGGCAGCGCCGTCACTCACCGCTCCGTCTCCCTCCGCCCAG 
12551 GGTCGCCTGGCGCAACCGCTGCAAGGGCACCGACGTCCAGGCGTGGATCA 
GAGGCTGCCGGCTGTGAGGAGCTGCCGCGCCCGGCCCGCCCGCTGCACAG 
CCGGCCGCTTTGCGAGCGCGACGCTACCCGCTTGGCAGTTTTAAACGCAT 
1270 1 CCCTCATTAAAACGACTATACGCAAACGCCTTCCCGTCGGTCCGCGTCTC 
TTTCCGCCGCCAGGGCGACACTCGCGGGGAGGGCGGGAAGGGGGCCGGGC 
GGGAGCCCGCGGCCAACCGTCGCCCCGTGACGGCACCGCCCCGCCCCCGT 
12851 GACGCGGTGCGGGCGCCGGGGCCGTGGGGCTGAGCGCTGCGGCGGGGCCG 
GGCCGGGCCGGGGCGGGAGCTGAGCGCGGCGCGGCTGCGGGCGGCGCCCC 
CTCCGGTGCAATATGTTCAAGAGAATGGCTGAGTTCGGGCCTGACTCCGG 
13001 GGGCAGGGTGMGGTGCGGCGCGGGCGGAGGGACGGGGCGGGCGCGGGGC 
CGCCCGGCGGGTGCCGGGGCCTCTGCCGGCCCGCCCGGCTCGGGCTGCTG 
CG(3CGC™CGG^GCGCTTCTCGCCGCTGCCGCTTCTCTTCTCTCCCGC 
13151 GCAAGGGCGTCACCATCGTGAAGCCGGTAGTGTACGGGAACGTGGCGCGG 
TACTTCGGGAAGAAGAGGGAGGAGGACGGGCACACGCATCAGTGGACGGT 
TTACGTGAAGCCCTACAGGAACGAGGTAGGGCCCGAGCGCGTCGGCCGCC 
13301 GTTCTCGGAGCGCCGGAGCCGTCAGCGCGGCGCCTGGGTGCGCTGTGGGA 
CACAGCGAGCTTCTCTCGTAGGACATGTCCGCCTACGTGAAAAAAATCCA 
GTTCAAGCTGCACGAGAGCTACGGGAATCCTCTCCGAGGTGGGTGTTGCG 
13451 TCGGGGGGTnGCTCCGCTCGGfTCCCGCTGAGGCTCGTCGCCC^^ 

TCTrrCGTGCCGCAGTCGTTACCAAACCGCCGTACGAGATCACCGAAACG 
13551 GGCTGGGGCGMTTTGAMTCATCATCMGATATTT1TCATTGATOAM 
CGAGCGACCCGTAAGTACGCTCAGCTTCTCGTAGTGCTTCCCCCGTCCTG 
GCGGCCCGGGGCTGGGCTGCTCGCTGCTGCCGGTCACAGTCCCGCCAGCC 
13701 GCGGAGCTGACTGAGCTCCCTTTCCCGGGACGTGTGCTCTGTGTTCGGTC 
AGCGAGGCTATCGGGAGGGCTTTGGCTGCATTTGGCTTCTCTGGCGCTTA 
GCGCAGGAGCACGTTGTGCTACGCCTGAACTACAGCTGTGAGAAGGCCGT 
13851 GGAAACCGCTCTCAAACTGA7TTATTGGCGAAATGGCTCTAMCTAAATC 
GTCTCCTCTCTTTGGAAATGCTTTAGAGAAGGTCTCTGTGGTAGTTCTTA 
TGCATCTATCCTAMGCACTTGGCCAGACMTTTAAAGACATCAAGCAGC 
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14001 ATTTATAGCAG6CACGTTTAATAACGAATACTGAATTTAA6TAACTCTGC 
TCACGTTGTATGACGTTTATTTTCGTATTCCTGAAAGCCATTAAAATCCT 
GTGCAGTTGTTTAGTAAGAACAGCTGCCACTGTTTTGTATCTAGGAGATA 
14151 ACTGGTGTTTCCCTACAGTTCTCAAGCTGATAAAACTCTGTCTTTGTATC 
TAGGTAACCCTGTATCACTTGCTGAAGCTTnTCAGTCTGACACCAATGC 
AATCCTGGGAAAGAAAACTGTAGTTTCTGAATTCTATGATGAAATGGTAT 
14301 GAAMTTTTMTGTCMCCGAGCCTGACTTTATTTAAAAAAAATTATTGA 
TGGTGCTGTGTATTTTGGTCCTTCCTTAGATATTTCAAGATCCTACTGCC 
ATGATGCAGCAACTGCTAACGACGTCCCGTCAGCTGACACTTGGTGCTTA 

14451 CAAGCATGAAACAGAGTGTAAGTGCAAAATGAGGATACCTTCGCCGACCG 
TCATTCACTACTAATGTTTTCTGTGGGATGTGATCGTACAGTGAGTTTGG 
CTGTGTGAAATTTGAATAGCTTGGTATTGGCAGTGATGACGTGATCGATG 

14601 CCTTGCTTATCATGTTTGAAATGAAGTAGAATAAATGCAGCCTGCTTTAT 
TTGAGATAGTTTGGTTCATTTTATGGAATGCAAGCAAAGATTATACTTCC 
TCACTGAATTGCACTGTCCAAAGGTGTGAAATGTGTGGGGATCTGGAGGA 

14751 CCGTGACCGAGGGACATTGGATCGCTATCTCCCATTTCTTTTGCTGTTAC 
CAGTTCAGATTTTCTTTTCACCTAGTCTTTMTTCCCAGGGTTTTGTTTT 
TTCCTTGGTCATAGTTTTTGTTTTTCACTCTGGCAAATGATGTTGTGAAT 

14901 TACACTGCTTCAGCCACAAMCTGATGGACTGMTGAGGTCATCAAACAA 
ACTmcmmCGTAmCCTTTTTmCCCCCACmTCATTnTAC 
TGCTGTTGTTGAGTCTGTMGGCTAAMGTMCTGTTTTGTGCTTTTTCA 

15051 GGACGTGTGCTTTCCAAATTACTGCCA CATATAT AAAGAAAGGTTGGAAT 
TTTAMGATMTTCATGTTTCTTCTTCTTTTTTGCCACCACAGTTGCAGA 
TCTTGAAGTAAAAACCAGGGAAAAGCTGGAAGCTGCCAAAAAGAAAACCA 

15201 GTTTTGAAATTGCTGAGCTTAAAGAAAGGTTAAAAGCAAGTCGTGAAACC 
ATCAACTGCTTAAAGAGTGAAATCAGAAAACTCGAAGAGGATGATCAGTC 
TAAAGATATGTGATGAGTGTTGACTTGGCAGGGAGCCTATAATGAGAATG 

15351 AMGGACTTCAGTCGTGGAGTTGTATGCGTTCTCTCCAATTCTGTAACGG 
AGACTGTATGMTTTCATTTGCAAATCACTGCAGTGTGTGACAACTGACT 
TTTTATAAATGGCAGAAAACAAGAATGAATGT ATCCTC ATTTTATAGTTA 

15501 AMTCTATGGGTATCTACTGGTnATTTCMGGAGMTGGATCGT 

CTTGGAGGCCAGAnGCTGCTTGTATTGACTGCATTTGAGTGGTGTAGGA 
ACATTTTGTCTATGGTCCCGTGTTAGTTTACAGAATGCCACTGTTCACTG 

15651 TTTTtTTTrTGTATTTTACTTTTTCT^ 

■ TGAAMTAAMCATGCAGGrnTTTTTTAAATA 1 1 1 1 1 1 1 GTCTCTj\TCCA 

GTTTGGGCTTCMGTAnATTGTTAACAGCAAGrrCCTGATTTAAGTCAGA 
15801 GGCTGMGTGTMTGGTATTCMGATGCTTMGTCTGTTGTCAGCAAMC 

AAMGAGAAMCTTCATAAMTCAGGMGTTGGCATTTCTMTMCTTCT 

mTCMCAGATMGAGmCTAGCCCTGCATCTACmCACTTATGTAG 
15951 TTGATGCCTTTATATTTTGTGTGTTTGGATGCAGGMGTGATTCCTACTC 

TGTOTCTAGATAnCTATTTAACACTTGTACTCTGCTGTGCTTAGCCTT 
16051 TCCCCATGAAMTrCAGCGGCTGTAMTCCCCCTCTTCTTTTCTAQ^TC 

aWagatggcagaccctcaggcttatamggcttgggcatcttctttac 

TGCTTTGAGATTCTGTGTTGCAGTAACCTCTGCCAGAGAGGAGAAAAGCC 
16201 CCACAMCCTCATCCCCTTCTTCTATAGCMTCAGTATTACTMTGCTTT 
GAGMCAGAGCACTGGTTTGAMCGTTTGATMTTAGCATTTAACATGGC 
TTGGTAAAGATGCAGAACTGAAACAGCTGTGACAGTATGAACTCAGTATG 

FIG.6G 
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16351 GAGACTTCATTMGACAM 

GAGGACGGGGCMCTTGCACCACTGGTGCCCACACAMTCC^^TGGCG 

CTGCAGACCMTTTTTCTGGCATTCTGACTGTCGTTCCTGCTG 

16501 AGAGCMCTATTTTTATCAGCCACAGGCMTTTGCTTO 

' AGTGTTGTAGGTMGTATAMTGCATCGGCTCCAGAGCACTTTGAGTATA 
ClTATTAAAMCATAMTGAMGACAAATTAGCTnGCnGGGTGCACA^ 

16651 MCATTTTTAGTTCCAGCCTGCTTTnGGTAGMGCCCTC 

AGMCTGACmGACMGTAGAGAMCTGGCAACGGAGCTATOTATCG 

16751 MGGATCCnGTTMCAMGTTAATCGTCTmMG^ 

TTAMTTTGCTTTTMGCTGTAGCTGAAAMGMCGTGCTGTCTTCCATG 

16851 CACCAGGTGGCAGCTCTGTGCAMGTGCTCTC^ 

ctctSgcctggtctggtggttgg^ 
1 71 51 ttgaaactcgatgatcactgtggtccttttcaacccaggctattctatga 

TTC^TGATTCAACAGCAMTCATATGTACTGAGAGAGGAMCAAACACA 
A^TArTfimr^MGTmGmA TTTGGTAAAAGAGT CAGG^TTO 

17301 AMnCAAMTCTGTCTGGTTTOTGIIIIlim^ 

TCTTTGGGGTTCTTTTTGATGCTTTATCTTTCTCTGCCAGGACTGTGTGA 

CMTGGGMCGAAAMGMCATGCCAGGCACTGTCCTGGATTGCACACGC 

17451 TGGnGCACTCAGTAGCAGl^ 

rTTTrTAAACCTAATTTTAATAGCGTTAGTAGACTTCCATCACTGGGCAG 

TGCTTACTG^TGCTCTGWGMCGTTTTA 
17601 TT^GATGTTCCTGGATGWGTAGGGMGGACAGATTAGCTATGTGAAAA 

17601 SJ3S 

CTTGTTGTMCTMTTGGMTnCTTTMGCGTTCACTTATGCTACATTC 

17751 ATAGTATTTCCATTTAAAAGTAGGAAAAGGTAAAACCTGAM 

TTnCGGATGGMCACCGCCTTCCTATGCACCTGACCAACTTCCAGAGGA 

AA^CCWTGAMGCCGAGATTMGCCACCAAAAGAA^ 

17901 GGMTATGTAGTATnGCCCTCTTCCTCCCGG^^ 

AGGGTGKTrATATGTTAMTGAGTG(£T^ 
GTGGGGMnCTGTCCTCTAGGACAGAMCMTTTTAATCTGTTCCACTG 

18051 GTGACTGCTtTGTCAGCACTTCCACCTGAAGA 
TGTCTAGTTCTGCMCACTrcGO^ 

TTCATGCCTATGCTTATTAMGCMTMTCTGGGn^MTTTTTGTTTTAAT 
lft?ni ^CTffrcCTGACCCCAGTGATGACCGTGTCCCACCTAMGCTCMTTGAG 

TTA^TCTGAG^^ 

18351 ^^^^^U^l^^g^ 
GGCCAGCTGGTCGACCTGCAGATCO^CCTCG^^ 
rA(^TTTTGnCCCTnAGTGAGGGTTMTTTCGAGCTTGGCGTAATCAT 

R^^AGCTGT^CCTGTGTGAMTTGTTATCCGCTCACMTTCCACAC 

RARrTAACTCACATTAATTGCGTTGCGCTCACTGCCCGCTTTCCAGTCGG 
18651 gSK^^ 

FIG.6H 
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GGCGGTTTGCGTATTGGGCGCTCTTCCGCTTCCTCGCTCACTGACTCGCT 
GCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTCAAAGGCGG 
- """" TAATACGGTTATCCACAGAATCAGGGGATAACGCAGGAAAGAACATGTGA 
GCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCGTTGCTGGC 
GTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCACAAAAATCGACGCT 
CAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTT 
CCCCCT(3GMGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTAC 
CGGATACCTGTCCGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCATA 
19101 GCTCACGCTGTAGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCCAAGCTG 
S^nricr nrr.Mrrcrrc RTTf.ARCCCGACCGCTGCGCCTTATCCGG 

19251 
19401 



TAATACGGTTATCCACAGAATCAGGGGATAACGCAGGAAAGAACATGTGA 
GCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCGTTGCTGGC 
GTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCACAAAAATCGACGCT 
18951 CAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTT 

CGGATACCTGTCCGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCATA 
GCTCACGCTGTAGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCCAAGCTG 
GGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATCCGG 
TAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACTGG 
1 9251 CAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCT 
ACAGAG^C^GAAGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGT 
ATTTGGTATCTGCGCTCTGCTGAAGCCAGTTACCTTCGGAAAAAGAGTTG 

GTAGCTCTTGATCCGGCAAACAAACCACCGCTGGTAGCGGTGG 1 1 1 1 1 1 1 
GTTTGCMGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAAGATCC 
TTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTr 

19551 AAGGGATTTTGGTC ATGAGATTATC AAAAAGGATCTTC AC CTAGATC CTT 
^AM^AAAAATGMGTTTTAAATCAATCTAAAGTATATATGAGTAAAC 
TTGGTCTGACAGTTACCMTGCTTAATCAGTGAGGCACCTATCTCAGCGA 
19701 TCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGATA 
ACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACC 
Aorr ArcrTrArrccrTrrAfiATTTATCAGCAATAAACCAGCCAG 



GCGAGACCCACGCTCACCGGCTCCAGATTTATCAGCMTAMCCAGCCAG 
19851 CCGGMGGGCCGAGCGCAGAAGTGGTCCTGCMCTTTATCCGCCTCCATC 
CAffrCTATTAATTGTTGCCGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAA 

TAGT^GCGCMCGTTGTTGCCATTGCTACAGGCATCGTGGTCT 
CGTCGTTTGGfTATGGCTTCATTCAGCTCCGGTTCCCMCGATCAAGGCGA 

OTACATGATCCCCCATGTTGTO^ 
TCCGATCGTTGTCAGiMGTMGTTGGCCGCAGT^^ 
TGGCAGCACTGCATMTTCTCTTACTGTCAT^CATCCGTA 

TCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATAGTGTATGCG 
»^.^rA ^-TT/vTrTTcrrrcfirfiTr A ATAf GGGATAATACCGCGCCAC 



TCTGTGACTGGTGAGTAC I CAALLAAla I LA ll L I uHUnn I m , amiu^ 

GGTGAGCAAAMCAGGMGGCAAMTGCCGCAAAAMGGGAATAAGG^G 
ACACGGAMTGTTGMTACTCATACTCTTCCITTTTCMTAT^ 

CATnATCAGGGTTATTCTCTCATGAGCG 
AGAAAMTAMCAMTAGGGGTTCCGCGCACATTTCCCCGAAAAG^CA 

CCTAAA^GTMGCGTTMTAT^ 

TAMTCAGCTCATTTTTTMCCMTAGGCCGAW 

TAAATCAAMGMTAGACCGAGATAGGGTTGAGTGTTGTTCCAGTnGGA 

ACMGAGTCCACTATTAMGAACGTGGACTCCMCGTCAMGG^GAAAA 

ACCGTCTATCAGGGCGATGGCCCACTACGTGMCCATCACCCTMTCAAG 

TTTTTTGGGGTCGAGGTGCCGTAMGCAGTAMTCGGMCCCTAAAGGGA 

GCCCCCGATTTAGAGCTTGACGGGGAMGCCGGCGMCGTGGCGAGAAAG 

GAAGGGAAGAAAGCGAAAGGAGCGGGCGCTAGGGCGCTGGCAAGTGTAGC 
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21051 GGTCACGCTGCGCGTAACCACCACACCCGCCGC6CTTAATGCGCC6CTAC 
AGGGCGCGTCCCATTCGCCATTCAGGCTGCGCAACTGTTGGGMGGGCGA 
TCGGTGCGGGCCTCTTCGCTATTACGCCAGCTGGCGAAAGGGGGATGTGC 

21201 TGCAAGGCGATTAAGTTGGGTAACGCCAGGGTTTTCCCAGTCACGACGTT 
GTAAAACGACGGCCAGTGAATTGTAATACGACTCACTATAGGGCGAATTG 

21301 GAGCTCCACCGCGGTGGCGGCCGCTCTAG 
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SEQ ID NO: 11 

GTACCGGGCCCCCCCTCGAG6TGAATATCCAAGAAT6CAGAACTGCAT6GAAAGCAGA6CTG 

CAGGCACGATGGTG^ 
GMGGACCT^ 

AAGAGG I I I 1 1 1 I I I GGTCCAAMGTCTGTTTGTTTGGTTTTGACCACTGAGAGCAT 
GTGACACTTGtCTCMGCTATTM 

TTnCCATnGMGCCCCTTGCAAAAACAAAGAGCACCTTGCCTGCTCCAGCTCCTGGCTGT 
GMGGGTTITGCTGC^ 



GCAAC^GGGGGCTATTGAGAMCMGGMGGACAMCAGCGTTAGGTCATTGCTTCTGCM 
ACACAG^ 

CAGAGGGGACGGTCTGTGAATCAAGCTT 

FIG. 14 



SUBSTITUTE SHEET (RULE 26) 



WO 03/025146 



34/43 



PCT/US02/29878 



SEQ ID NO: 17 
I FN -A 

ATGGCTTTGACCTTT6CCTTACT6GTGGCTCTCCTGGT6CTGAGCT6CAAGAGCA6CTGCTCTGT 
GGGCTGCGATCTGCCTCA 

SEQ ID NO: 18 
IFN-B 

GACCCACAGCCTGGGCAGCAGGAGGACCCTGATGCTGCTGGCTCAGATGAGGAGAATCAGCCTGT 
TTAGCTGCCTGMGGATAGGCACGATTTTGGCTTT 

SEQ ID NO: 19 
IFN-C 

CTCMGAGGAGTTTGGCAACCAGTTTCAGAAGGCTGAGACCATCCCTGTGCTGCACGAGATG 

SEQ ID NO: 20 
IFN-D 

TCCAGCAGATCTTTMCCTGTTTAGCACCAAGGATAGCAGCGCTGCTTGGGATGAGACCCTGCTG 
GATAAGTTTTACACCGAGCTGTACCAGCA 

SEQ ID NO: 21 
IFN-E 

CTGAACGATCTGGAGGCTTGCGTGATCCAGGGCGTGGGCGTGACCGAGACCCCTCTGATGAAGGA 
GGATAGCATCCT 

SEQ ID NO: 22 
IFN-F 

GCTGTGAGGAAGTACTTTCAGAGGATCACCCTGTACCTGAAGGAGAAGAAGTACAGCCCTTGCGC 
TTGGGAAGTCGTGAGGG 

SEQ ID NO: 23 

IFN-G " 

CTGAGATCATGAGGAGCmAGCCTGAGCACCMCCTGCMGAGAGCTTGAGGTCTAAGGAGTAA 

SEQ ID NO: 24 
IFN-1 

CCCMGCTTTCACCATGGCTTTGACCTTTGCCTT 

SEQ ID NO: 25 
IFN-2b 

ATCTGCCTCAGACCCACAG 

FIG.15A 
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SEQ ID NO: 26 
IFN-3c 

GATTTTGGCTTTCCTCAAGAG6AGTT 

SEQ ID NO: 27 
IFN-4b 

GCACGAGATGATCCAGCAGAT 

SEQ ID NO: 28 
IFN-5 

ATCGTTCAGCTGCTGGTACA 

SEQ ID NO: 29 
IFN-6 

CCTCACAGCCAGGATGCTAT 

SEQ ID NO: 30 
IFN-7 

ATGATCTCAGCCCTCACGAC 

SEQ ID NO: 31 
IFN-2 

CTGTGGGTCTGAGGCAGAT 

SEQ ID NO: 32 
IFN-3b 

AACTCCTCTTGAGGAAAGCCAAAATC 

SEQ ID NO: 33 
IFN-4 

ATCTGCTGGATCATCTCGTGC 

SEQ ID NO: 34 
IFN-8 

TGCTCTAGACTTTTTACTCCTTAGACCTCAAGCTCT 
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Bird Number 



FIG.17 



WO 03/025146 



PCT/US02/29878 



38/43 



CO 
• ■ ■ 




SUBSTITUTE SHEET (RULE 26) 



WO 03/025146 



PCTAJS02/29878 




WO 03/025146 



PCT/US02/29878 



in /at 




SUBSTITUTE SHEET (RULE 26) 



WO 03/025146 PCT/US02/29878 



250 



<r> 200 



CNI 
I 

o 



150 



2 100 



50 



p/MJCR- 
A1 15.93.1.2 



^ISeries 1 
ISeries 2 




I 



pCMV- 0 12.5 25 50 100 200 
EGFP pg/ ml pg/ml pg/ml pg/ ml pg/ml pg/ml 



FIG.21 



SUBSTITUTE SHEET (RULE 26) 



PCTAJS02/29878 

■ 

42/43 



SEQ ID NO: 38 

Oligo 1. TCACTCGA6GTGAATATCCAAGAAT 
SEQ ID NO: 39 

Oligo 2. GAGATCGATTTTGGCTGGACACTTG 
SEQ ID NO: 40 

Oligo 3. CACATCGATGTCACAACTTGGGAAT 
SEQ ID NO: 41 

Oligo 4. TCTAAGCTTCGTCACAGACCGTCCC 

FIG. 22 
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SEQUENCE LISTING 

<110> Avigenics, Inc. 

<120> Production of a Transgenic Avian by 
Cytoplasmic Injection 

<130> 11106-014-228 

<140> To be assigned 
<141> 2002-09-18 

<150> 60/351,550 
<151> 2002-01-25 

<150> 60/322,969 
<151> 2001-09-18 

<160> 41 

<170> Patentln version 3.1 

<210> 1 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 5pLMAR2 

<400> 1 

tgccgccttc tttgatattc 

<210> 2 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer LE-6.1kbrevl 

<400> 2 

ttggtggtaa ggcctttttg 



<210> 3 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer lys-6.1 

<400> 3 

ctggcaagct gtcaaaaaca 



<210> 4 

<211> 20 

<212> DNA 

<213> Artificial sequence 



20 



20 



20 



-1- 
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<220> 

<223> Primer LysSlrev 



<400> 4 

cagctcacat cgtccaaaga 



<210> 5 

<211> 498 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> IFNMAGMAX 
<220> 

<221> misc_feature 

<222> (1)..(498) 
<223> 



20 



4 98 



<4O0> 5 • *. 

tgcgatctgc ctcagaccca cagcctgggc agcaggagga ccctgatgct gctggctcag bu 

atgaggagaa tcagcctgtt tagctgcctg aaggataggc acgattttgg ctttcctcaa 120 

gaggagtttg gcaaccagtt tcagaaggct gagaccatcc ctgtgctgca cgagatgatc 180 

cagcagatct ttaacctgtt tagcaccaag gatagcagcg ctgcttggga tgagaccctg 240 

ctggataagt tttacaccga gctgtaccag cagctgaacg atctggaggc ttgcgtgatc 300 

cagggcgtgg gcgtgaccga gacccctctg atgaaggagg atagcatcct ggctgtgagg 360 

aagtactttc agaggatcac cctgtacctg aaggagaaga agtacagccc ctgcgcttgg 420 

gaagtcgtga gggctgagat catgaggagc tttagcctga gcaccaacct gcaagagagc 480 
ttgaggtcta aggagtaa 

<210> 6 

<211> 12728 

<212> DNA 

<213> Gallus gallus 

<220> 

<221> mis cofeature 

<222> (1)..(237) 

<223> 5prime matrix (scaffold) attachment region (MAR) 
<220> 

<221> misc_feature 

<222> (261) (1564) . 

<223> Sprime matrix (scaffold) attachment region (MAR) 

<220> 

<221> misc_feature 

<222> (1565) .. (1912) 

<223> Sprime matrix (scaffold) attachment region (MAR) 
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<220> 
<221> 
<222> 
<223> 



misc_f eature 

(1930) (2012) , / mb T3 \ 

Sprime matrix (scaffold) attachment region (MAR) 



<220> 
<221> 
<222> 
<223> 



misc_feature 
(2013) . . (2671) 
Intrinsically curved DNA 



<220> 
<221> 
<222> 
<223> 



misc_f eature 
(5848) . . (5934) 
Transcription Enhancer 



<220> 
<221> 
<222> 
<223> 



misc_f eature 
(9160) . . (9325) 
. Transcription Enhancer 



<220> 
<221> 
<222> 
<223> 



misc_f eature 
(9326) . . (9626) 

Negative Regulatory Element 



<220> 
<221> 
<222> 
<223> 



misc_f eature 
(9621) . . (9660) 
Hormone Response 



Element 



<220> 
<221> 
<222> 
<223> 



mi sc_f eature 
(9680) . . (10060) 
Hormone Response 



Element 



<220> 
<221> 
<222> 
<223> 



misc_f eature 
(10576) . . (10821) 
Chicken CR1 Repeat Sequence 



<220> 
<221> 
<222> 
<223> 



misc_f eature 
(10926) (11193) 
Chicken CR1 Repeat 



Sequence 



<220> 
<221> 
<222> 
<223> 



mi sc_f eature 

(11424) (11938) _ _ n ... 

Lysozyme Proximal Promoter and Lysozyme Signal Peptide 



<220> 

<221> misc feature 
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<222> (11946) .. (12443) . , . . . - r ovn 

<223> Human Interferon alpha 2d encoding region codon optimized for exp 

ression in chicken cells (IFNMAGMAX) 



<220> 

<221> polyA_signal 

<222> (12444) . . (12728) 
<223> 



tgScgccttc tttgatattc actctgttgt atttcatctc ttcttgccga tgaaaggata 
taacagtctg tataacagtc tgtgaggaaa tacttggtat ttcttctgat cagtgttttt 
ataagtaatg ttgaatattg gataaggctg tgtgtccttt gtcttgggag acaaagccca 
cagcaggtgg tggttggggt ggtggcagct cagtgacagg agaggttttt ttgcctgttt 
tttttttttt tttttttttt aagtaaggtg ttcttttttc ttagtaaatt ttctactgga 
ctgtatgttt tgacaggtca gaaacatttc ttcaaaagaa gaaccttttg gaaactgtac 
agcccttttc tttcattccc tttttgcttt ctgtgccaat gcctttggtt ctgattgcat 
tatggaaaac gttgatcgga acttgaggtt tttatttata gtgtggcttg aaagcttgga 
tagctgttgt tacacgagat accttattaa gtttaggcca gcttgatgct ttattttttc 
cctttgaagt agtgagcgtt ctctggtttt tttcctttga aactggtgag gcttagattt 
ttctaatggg attttttacc tgatgatcta gttgcatacc caaatgcttg taaatgtttt 
cctagttaac atgttgataa cttcggattt acatgttgta tatacttgtc atctgtgttt 
ctagtaaaaa tatatggcat ttatagaaat acgtaattcc tgatttcctt tttttttatc 
tctatgctct gtgtgtacag gtcaaacaga cttcactcct atttttattt atagaatttt 
atatgcagtc tgtcgttggt tcttgtgttg taaggataca gccttaaatt tcctagagcg 
atgctcagta aggcgggttg tcacatgggt tcaaatgtaa aacgggcacg tttggctgct 
gccttcccga gatccaggac actaaactgc ttctgcactg aggtataaat cgcttcagat 
cccagggaag tgcagatcca cgtgcatatt cttaaagaag aatgaatact ttctaaaata 
ttttggcata ggaagcaagc tgcatggatt tgtttgggac ttaaattatt ttggtaacgg 
agtgcatagg ttttaaacac agttgcagca tgctaacgag tcacagcgtt tatgcagaag 
tgatgcctgg atgcctgttg cagctgttta cggcactgcc ttgcagtgag cattgcagat 
aggggtgggg tgctttgtgt cgtgttccca cacgctgcca cacagccacc tcccggaaca 
catctcacct gctgggtact tttcaaacca tcttagcagt agtagatgag ttactatgaa 
acagagaagt tcctcagttg gatattctca tgggatgtct tttttcccat gttgggcaaa 
gtatgataaa gcatctctat ttgtaaatta tgcacttgtt agttcctgaa tcctttctat 1500 
agcaccactt attgcagcag gtgtaggctc tggtgtggcc tgtgtctgtg cttcaatctt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
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ttaaagcttc tttggaaata cactgacttg attgaagtct cttgaagata gtaaacagta 

aaagaattcc gcctattcat 
ggaatatatt caagtaatag 



cttacctttg atcccaatga aatcgagcat ttcagttgta 
accatgtaat gtaattttac acccccagtg ctgacacttt 



tttgtaatag aaaatatttt aaactgtgca 

tatgattatt acattatgaa agagacattc tgctgatctt caaatgtaag aaaatgagga 

gcaaattagt gcaggtgtcc ttaaaaaaaa 
gttttacaag tgaaatacat tcctatttgg 
taaacagtta catttttatg aagattacca gcgctgctga ctttctaaac ataaggctgt 

ttcccaattt gcacaaggat gtctgggtaa 
atgggagctt gtctgagttg gaatgcagag 
ctctcagaat gcccaactcc aaaggatttt 



actttggcct caccctcttg tgtactgtat 



gtgcgtgtgc ttttataaat acaagtgatt 
aaaaaaaaag taatataaaa aggaccaggt 



attgtcttcc tgtaccattg catttcctca 
actattcaag aaatggcttt gaaatacagc 



caaagagtct gctgaatgtt 
tatcctagca actgcggatt 



ttgcactgca aaatgtcagg aaatggatgt 
atatgtgtat atagtaagca gtttcctgat tccagcaggc 
gtgttgccgg agacctgtat ttctcaacaa ggtaagatgg 
ttaatacatt ttcagcagaa gtacttagtt aatctctacc tttagggatc gtttcatcat 
ttttagatgt tatacttgaa atactgcata acttttagct ttcatgggtt cctttttttc 
agcctttagg agactgttaa gcaatttgct gtccaacttt tgtgttggtc ttaaactgca 
atagtagttt accttgtatt gaagaaataa agaccatttt tatattaaaa aatacttttg 
tctgtcttca ttttgacttg tctgatatcc ttgcagtgcc cattatgtca gttctgtcag 



atattcagac atcaaaactt 
gttattattt ctgaaactag 
agagtgtaag gctagtgaga 
tatcagattt ttttttcatt 
tgaaatgcag tctgattggc 
tttggaatga aggaagttaa 
gagaaagtga acctggattt 
cccgattcct tgaaagggct 
ggccactggt tatttactgc 
ttgagcatgg actatagcct 
gtgctgggct gtggctgggg 
cagggaaaag tgtgggtaac 



aacgtgagct 

aaatgatgtt 

aatgcataca 

tggaaatata 

atgaagaagc 

gcaagggcac 

ctttggctag 

ccagctttaa 

attatgtctc 

ggcttcagag 

ggactgtggg 

tatttttaag 



cagtggagtt 

gt.cttcatct 

tttattgata 

ttgttttcta 

acagcactct 

aggtccatga 

tgttctaaat 

tgcttccaaa 

agtttcgcag 

gccaggtgaa 

gactccaagc 

tactgtgttg 



acagctgcgg ttttgatgct 
gctcatcaaa cacttcatgc 
cttttttaaa gtcaactttt 
gactgcatag cttctgaatc 
tcatcttact taaacttcat 
aatagagaca gtgcgctcag 
ctgtagtgag gaaagtaaca 
ttgaaggtgg caggcaactt 
ctaacctggc ttctccacta 
ggttgggatg ggtggaagga 
tgagcttggg gtgggcagca 
caaacgtctc atctgcaaat 



acgtagggtg tgtactctcg aagattaaca gtgtgggttc agtaatatat ggatgaattc 
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acagtggaag cattcaaggg tagatcatct 
aagcggtatc agaagagcga ggaaggtaag 
gcagtctggg aaagtagcac cccttgagca 
taggagaact ttcttgctga attctacttg 
ttctgcagca cctgcaaggc ccagagcctg 
gtccaagctt cagcaggtca ttgtctttgc 



aacgacacca gatcatcaag 
cagtcttcat atgttttccc 
gagacaagga aataattcag 
caagagcttt gatgcctggc 
tggtgagctg gagggaaaga 
ttcttccccc agcactgtgc 



ctatgattgg 
tccacgtaaa 
gagcatgtgc 
ttctggtgcc 
ttctgctcaa 
agcagagtgg 



aactgatgtc gaagcctcct gtccactacc tgttgctgca ggcagactgc tctcagaaaa 
agagagctaa ctctatgcca tagtctgaag gtaaaatggg ttttaaaaaa 
aggcaaaacc ggctgcccca tgagaagaaa gcagtggtaa acatggtaga 

tgccacctag aggcgggaac 



aagcccccag gcagtgtgac aggcccctcc 
gcctagggct ctgcccgcga agtgcgtgtt 
ttgagattta gacacaaggg aagcctgaaa 
agcctgtact tcaaatatat attttgtgag 
aagttgcaag agattgaagg ctgagtagtt 
gaaactactg cttctaaaca cttgtttgag 
gttacatgtc tgatgcactt gcttgtcctt 



tctttggtgg gttttgtttg 
ggaggtgttg ggcactattt 
ggagtgtagc gaattggcca 
gagagggtaa cacgtttaat 
tggtgagacc ttggataggt 
ttccatccac atccatgcat 



gaaaacacaa 
aaaggtgcag 
aagcttccct 
gcgtttggtt 
tggtttgtaa 
atttaaaata 
gagatcttct 
gagtgctctt 
tccacatcca 



cgcatttgtc acttatccca 
tcagaagaaa cagatgtgat 
tctttccctt tttcctgcta 
tcttcctgcc ttacattctg 
tttgtgtctt cctactctta 
gttggccgca gttctctgat 



tctgaggaac 
ttgcaactga 
ttccttcttg 
gaaagtatgt 
cgggcaccat 



gggcagcgtt 
tggtggaact 
gcagtcagtt 
ggctctgcct 
tcatcccaaa 



tatctgtcat atctgacata cctgtctctt cgtcacttgg 

cgccccaagt ttgagaagat ggcagttgct 
ttctcct-ggc tttgacacct cacgaaatag 
caaatatctt tggagtgcgc tgctctcaag 
tcttagagtg aaagagaagg aagagaagat 
ctgaataatg gccaaaggtg ggtgggtttc 
agcaaggagc tctgcggagt tgcagttatt 
gcagattccc taggttccct gctacttctt 
agacaaacag ccacccccac tgcaggctta 
cagctctgcc ctggtgaaag gggattaaaa 
attcatggat caagctgtaa ggaacttggg 



aatccccagc 
agtaaggatt 
ggcattattt 
gagtgaatgc 
gaacacacct 
tgcctctgaa 
ggtgcttaaa 
tatttctgac 
gggtgtgtta 
caggatcctc 



ctccaacctc aaaacattaa ttggagtacg aatgtaatta aaactgcatt ctcgcattcc 
taagtcattt agtctggact ctgcagcatg taggtcggca gctcccactt tctcaaagac 
cactgatgga ggagtagtaa aaatggagac cgattcagaa caaccaacgg agtgttgccg 
aagaaactga tggaaataat gcatgaattg tgtggtggac atttttttta aatacataaa 
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agaccccttg ctgatttgtg tcaagctttt 
tacttctgcc ctcacttctg taagcttgtt 
gtgttacaec tacctgccct attcttgcat 
aggacagaat atgtattcct ctaacaaaaa 
agggttttca tggcagtatt agcacatagt 
gcctcccata aatcctgtat ttgggatcag 
agagaccctg ggggttctga tgtgcttcag 
agatcaccca gttgttcctg tacaacttcc 
ttgacaaata caggcctatt tttgtgtttg 



cccagggtac tgttggcctt tcaggctccc 
catccaccag aacccacgct tcctggttaa 
tcaggagact tccattcttt aggacagact 
atatacattt cagttcatgt ttcctgtaac 
tacatgcaga attcctagtg ccatctcagt 
caatttgctg caagtacctt ccaagctgcg 

ttaccttttg gggtaagctt ttgtatctgc 

ctctgctctg ttctgactgc accattttct 

ttgtcctcca tcctttccca gcttgtatct 

cttcagcagc catttaattc ttcagtgtca tcttgttctg ttgatgccac tggaacagga 

ttttcagcag tcttgcaaag aacatctagc 

cagttcttct tgtttgaggt gagccataaa 

gcattttatt acttctatta tgtacttact 

ctgggatttc cacagtgtct ctgtgtcctt 

aaccttggca atctgcccag ctgcccatca caagaaaaga gattcctttt ttattacttc 

tcttcagcca ataaacaaaa tgtgagaagc ccaaacaaga acttgtgggg caggctgcca 

tcaagggaga gacagctgaa gggttgtgta gctcaataga attaagaaat aataaagctg 



tgaaaacttt ctgccattca atattcttac 
ttactagaac ttcgtcactg acaagtttat 
ttgacataac acagacacgc acatattttg 
cacatggttt tactgtcata cttccgttat 



tgtcagacag ttttgcctga tttatacagg 
aggccacctt gcagtccttg gtttgtaaga 
cgtggagaat catgatggca gttcttgctg 
cagcaaagta acacttgctg ctgtaggtgc 
caccaagatg agggatgctc ccagctgacg 
ctgcctgctc attagcatca cctcagccct 
tgaggaaagt tgctcatctt cttcacatca 
gatgcttaaa tgtggtcact gacatcttta 
gatcaggagg gaacacatag tgggaatgta 
atgatcatgc atgctactta ggaaggtgtg 



cacgccccaa gccagagagg ctgtctgcca 
taagtcatag gtaacttttc tggtgaattg 
tttactatgg taagatgcta aaataggaga 
tctgctatcc agacagcgat ggcactcgca 
gatgctgggg cagtaacagt gggtcccatg 
caccagccca tcagaaggat catcccaagc 
tcaaaccttt ggcctgactg atgcctcccg 
tttttctatg atttcaagtc agaacctccg 
ccctcagctc caaggccaga tcttccttca 
tgtgtgtgaa tgtagaattg cctttgttat 



tttttcttcc tgctgtcagg aacattttga ataccagaga aaaagaaaag tgctcttctt 

taaaggatgc agtcccaaat gttcataatc 



ggcatgggag gagttgtcac acttgcaaaa 
tcagggtctg aaggaggatc agaaactgtg 
ttttgaaagc tgttcctggc cgaggcagta 



tatacaattt caggcttctc tgaatgcagc 
ctagtcagaa ccctcggaaa caggaacaaa 
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tgtcttcaag gtgcagcagg aggaaacacc ttgcccatca tgaaagtgaa taaccactgc 
cgctgaagga atccagctcc tgtttgagca ggtgctgcac actcccacac tgaaacaaca 
gttcattttt ataggacttc caggaaggat cttcttctta agcttcttaa ttatggtaca 
tctccagttg gcagatgact atgactactg acaggagaat gaggaactag ctgggaatat 
ttctgtttga ccaccatgga gtcacccatt tctttactgg tatttggaaa taataattct 
gaattgcaaa gcaggagtta gcgaagatct tcatttcttc catgttggtg acagcacagt 
tctggctatg aaagtctgct tacaaggaag aggataaaaa tcatagggat aataaatcta 
agtttgaaga caatgaggtt ttagctgcat ttgacatgaa gaaattgaga cctctactgg 9480 
atagctatgg tatttacgtg tctttttgct tagttactta ttgaccccag ctgaggtcaa 
gtatgaactc aggtctctcg ggctactggc atggattgat tacatacaac tgtaatttta 
gcagtgattt agggtttatg agtacttttg cagtaaatca tagggttagt aatgttaatc 
tcagggaaaa aaaaaaaaag ccaaccctga cagacatccc agctcaggtg gaaatcaagg 
atcacagctc agtgcggtcc cagagaacac agggactctt ctcttaggac ctttatgtac 
agggcctcaa gataactgat gttagtcaga agactttcca ttctggccac agttcagctg 
aggcaatcct ggaattttct ctccgctgca cagttccagt catcccagtt tgtacagttc 
tggcactttt tgggtcaggc cgtgatccaa ggagcagaag ttccagctat ggtcagggag 9960 
tgcctgaccg tcccaactca ctgcactcaa acaaaggcga aaccacaaga gtggcttttg 10020 
ttgaaattgc agtgtggccc agaggggctg caccagtact ggattgacca cgaggcaaca 10080 
ttaatcctca gcaagtgcaa tttgcagcca ttaaattgaa ctaactgata ctacaatgca 10140 
atcagtatca aoaagtggtt tggcttggaa gatggagtct aggggctcta caggagtagc 10200 
tactctctaa tggagttgca ttttgaagca ggacactgtg aaaagctggc ctcctaaaga 10260 
ggctgctaaa cattagggtc aattttccag tgcactttct gaagtgtctg cagttccoca 10320 
tgcaaagctg cccaaacata gcacttccaa ttgaatacaa ttatatgcag gcgtactgct 10380 
tcttgccagc actgtccttc tcaaatgaac tcaacaaaca atttcaaagt ctagtagaaa 104 40 
gtaacaagct ttgaatgtca ttaaaaagta tatctgcttt cagtagttca gcttatttat 10500 
gcccactaga aacatcttgt acaagctgaa cactggggct ccagattagt ggtaaaacct 10560 
actttataca atcatagaat catagaatgg cctgggttgg aagggaccco aaggatcatg 10620 
aagatccaac acccccgcca caggcagggc caccaacctc cagatctggt actagaccag 10680 
gcagcccagg gctccatcca acctggccat gaacacctcc agggatggag catccaoaao 10740 
ctctctgggc agcctgtgcc agcacctcac caccctctct gtgaagaact tttccctgac 10800 
atccaatcta agccttccct ccttgaggtt agatccactc ccccttgtgc tatcactgtc 10860 
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tactcttgta aaaagttgat tctcctcctt tttggaaggt tgcaatgagg tctccttgca 10920 
gccttcttct cttctgcagg atgaacaagc ccagctccct cagcctgtct ttataggaga 10980 
ggtgctccag ccctctgatc atctttgtgg ccctcctctg gacccgctcc aagagctcca 11040 
catctttcct gtactggggg ccccaggcct gaatgcagta ctccagatgg ggcctcaaaa 11100 
gagcagagta aagagggaca atcaccttcc tcaccctgct ggccagccct cttctgatgg 11160 
agccctggat acaactggct ttctgagctg caaottctcc ttatcagttc cactattaaa 11220 
acaggaaoaa tacaacaggt gctgatggcc agtgcagagt ttttcacact tcttcatttc 11280 
ggtagatctt agatgaggaa cgttgaagtt gtgcttctgc gtgtgcttct tcctcctcaa 11340 
atactcctgc ctgatacctc accccacctg ccactgaatg gctccatggc cccctgcagc 11400 
cagggccctg atgaacccgg cactgcttca gatgctgttt aatagcacag tatgaccaag 11460 
ttgcacctat gaatacacaa acaatgtgtt gcatccttca gcacttgaga agaagagcca 11520 
aatttgcatt gtcaggaaat ggtttagtaa ttctgccaat taaaacttgt ttatctacca 11580 
tggctgtttt tatggctgtt agtagtggta cactgatgat gaacaatggc tatgcagtaa 11640 
aatcaagact gtagatattg caacagacta taaaattcct ctgtggctta gccaatgtgg 11700 
tacttcccac attgtataag aaatttggca agtttagagc aatgtttgaa gtgttgggaa 117 60 
atttctgtat actcaagagg gcgtttttga caactgtaga acagaggaat caaaaggggg 11820 
tgggaggaag ttaaaagaag aggcaggtgc aagagagctt gcagtcccgc tgtgtgtacg 11880 
acactggcaa catgaggtct ttgctaatct tggtgctttg cttcctgccc ctggctgcct 11940 
tagggtgcga tctgcctcag acccacagcc tgggcagcag gaggaccctg atgctgctgg 12000 
ctcagatgag gagaatcagc ctgtttagct gcctgaagga taggcacgat tttggctttc 12060 
ctcaagagga gtttggcaac cagtttcaga aggctgagac catccctgtg ctgcacgaga 12120 
tgatccagca gatctttaac ctgtttagca ccaaggatag cagcgctgct tgggatgaga 12180 
ccctgctgga taagttttac accgagctgt accagcagct gaacgatctg gaggcttgcg 12240 
tgatccaggg cgtgggcgtg accgagaccc ctctgatgaa ggaggatagc atcctggctg 12300 
tgaggaagta ctttcagagg atcaccctgt acctgaagga gaagaagtac agcccctgcg 12360 
cttgggaagt cgtgagggct gagatcatga ggagctttag cctgagcacc aacctgcaag 12420 
agagcttgag gtctaaggag taaaaagtct agagtcgggg cggccggccg cttcgagcag 12480 
acatgataag atacattgat gagtttggac aaaccacaac tagaatgcag tgaaaaaaat 12540 
gctttatttg tgaaatttgt gatgctattg ctttatttgt aaccattata agctgcaata 12600 
aacaagttaa caacaacaat tgcattcatt ttatgtttca ggttcagggg gaggtgtggg 12660 
aggtttttta aagcaagtaa aacctctaca aatgtggtaa aatcgataag gatccgtcga 12720 
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12728 



gcggccgc 



<210> 7 

<211> 11945 

<212> DNA . 

<213> Gallus gallus 



<220> 

<221> misc_f eature 

<222> (1)..(237) 

<223> 5prime matrix 



attachment region (MAR) 



<220> 

<221> misc_f eature 

<222> (261) . . (1564) 

<223> 5prime matrix 



attachment region (MAR) 



<220> 

<221> misc_feature 

<222> (1565) . . (1912) 

<223> Sprime matrix attachment region (MAR) 



<220> 

<221> misc_feature 

<222> (1930).. (2012) . 

<223> 5prime matrix attachment region (MAR) 



<220> 

<221> misc_feature 

<222> (2013) . . (2671) 

<223> Intrinsically Curved DNA 



<220> 

<221> misc_feature 

<222> (5848) (5934) 

<223> Transcription Enhancer 



<220> 

<221> misc_feature 

<222> (9160) . . (9325) 

<223> Transcription Enhancer 



<220> 

<221> misc_f eature 

<222> (9326) .. (9626) 

<223> Negative Regulatory Element 



<220> 

<221> misc_feature 

<222> (9621) . . (9660) 

<223> Hormone Response Element 
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<220> 

<221> misc_feature 

<222> (9680) . . (10060) 

<223> Hormone Response Element 



<220> 

<221> misc_feature 

<222> (10576) . . (10821) 

<223> Chicken CR1 Repeat 



<220> 

<221> misc_feature 

<222> (10926) . . (11193) 

<223> Chicken CRl Repeat 



<220> 

<221> misc_feature 

<222> (11424) . . (11938) 

<223> Proximal promoter and lysozyme signal peptide 



tgccgccttc tttgatattc actctgttgt atttcatctc ttcttgccga tgaaaggata 
taacagtctg tataacagtc tgtgaggaaa tacttggtat ttcttctgat cagtgttttt 
ataagtaatg ttgaatattg gataaggctg tgtgtccttt gtcttgggag acaaagccca 
cagcaggtgg tggttggggt ggtggcagct cagtgacagg agaggttttt ttgcctgttt 
tttttttttt tttttttttt aagtaaggtg ttcttttttc ttagtaaatt ttctactgga 
ctgtatgttt tgacaggtca gaaacatttc ttcaaaagaa gaaccttttg gaaactgtac 
agcccttttc tttcattccc tttttgcttt ctgtgccaat gcctttggtt ctgattgcat 
tatggaaaac gttgatcgga acttgaggtt tttatttata gtgtggcttg aaagcttgga 
tagctgttgt tacacgagat accttattaa gtttaggcca gcttgatgct ttattttttc 
cctttgaagt agtgagcgtt ctctggtttt tttcctttga aactggtgag gcttagattt 
ttctaatggg attttttacc tgatgatcta gttgcatacc caaatgcttg taaatgtttt 
cctagttaac atgttgataa cttcggattt acatgttgta tatacttgtc atctgtgttt 
ctagtaaaaa tatatggcat ttatagaaat acgtaattcc tgatttcctt tttttttatc 
tctatgctct gtgtgtacag gtcaaacaga cttcactcct atttttattt atagaatttt 
atatgcagtc tgtcgttggt tcttgtgttg taaggataca gccttaaatt tcctagagcg 
atgctcagta aggcgggttg tcacatgggt tcaaatgtaa aacgggcacg tttggctgct 
gccttcccga gatccaggac actaaactgc ttctgcactg aggtataaat cgcttcagat 
cccagggaag tgcagatcca cgtgcatatt cttaaagaag aatgaatact ttctaaaata 
ttttggcata ggaagcaagc tgcatggatt tgtttgggac ttaaattatt ttggtaacgg 
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agtgcatagg 
tgatgcctgg 
aggggtgggg 
catctcacct 
acagagaagt 
gtatgataaa 
agcaccactt 
ttaaagcttc 
cttacctttg 



ttttaaacac 
atgcctgttg 
tgctttgtgt 
gctgggtact 
tcctcagttg 
gcatctctat 



tcacagcgtt tatgcagaag 1200 



agttgcagca tgctaacgag 
cagctgttta cggcactgcc 
cgtgttccca cacgctgcca 
tttcaaacca tcttagcagt agtagatgag ttactatgaa 



ttgcagtgag cattgcagat 
cacagccacc tcccggaaca 



gatattctca tgggatgtct 
ttgtaaatta tgcacttgtt 



tttttcccat gttgggcaaa 
agttcctgaa tcctttctat 
tgtgtctgtg cttcaatctt 
cttgaagata gtaaacagta 
aaagaattcc gcctattcat 
accatgtaat gtaattttac acccccagtg ctgacacttt ggaatatatt caagtaatag 



attgcagcag gtgtaggctc tggtgtggcc 
tttggaaata cactgacttg attgaagtct 
atcccaatga aatcgagcat ttcagttgta 



gtgcgtgtgc ttttataaat 



actttggcct caccctcttg tgtactgtat 
tatgattatt acattatgaa agagacattc 

acaagtgatt 
aaaaaaaaag taatataaaa aggaccaggt 
taaacagtta catttttatg aagattacca 
attgtcttcc tgtaccattg catttcctca 



tttgtaatag aaaatatttt aaactgtgca 
tgctgatctt caaatgtaag aaaatgagga 
gcaaattagt gcaggtgtcc ttaaaaaaaa 
gttttacaag tgaaatacat tcctatttgg 
gcgctgctga ctttctaaac ataaggctgt 
ttcccaattt gcacaaggat gtctgggtaa 



actattcaag aaatggcttt gaaatacagc 



ttgcactgca aaatgtcagg 
atatgtgtat atagtaagca 
gtgttgccgg agacctgtat 
ttaatacatt ttcagcagaa 
ttttagatgt tatacttgaa 
agcctttagg agactgttaa 
atagtagttt accttgtatt 
tctgtcttca ttttgacttg 
atattcagac atcaaaactt 
gttattattt ctgaaactag 
agagtgtaag gctagtgaga 
tatcagattt ttttttcatt 
tgaaatgcag tctgattggc 
tttggaatga aggaagttaa 



atgggagctt gtctgagttg gaatgcagag 
aaatggatgt ctctcagaat 

tccagcaggc 



gtttcctgat 



gcccaactcc aaaggatttt 
caaagagtct gctgaatgtt 



ttctcaacaa ggtaagatgg tatcctagca actgcggatt 



gtacttagtt 
atactgcata 
gcaatttgct 



aatctctacc tttagggatc gtttcatcat 
acttttagct ttcatgggtt cctttttttc 
gtccaacttt tgtgttggtc ttaaactgca 
agaccatttt tatattaaaa aatacttttg 



gaagaaataa 

tctgatatcc ttgcagtgcc cattatgtca gttctgtcag 
aacgtgagct 



cagtggagtt acagctgcgg ttttgatgct 
gtcttcatct gctcatcaaa cacttcatgc 
tttattgata cttttttaaa gtcaactttt 
ttgttttcta gactgcatag cttctgaatc 
acagcactct tcatcttact taaacttcat 



aaatgatgtt 
aatgcataca 
tggaaatata 
atgaagaagc 

gcaagggcac aggtccatga aatagagaca gtgcgctcag 



* »■» /■ r\ 
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gagaaagtga acctggattt ctttggctag 
cccgattcct tgaaagggct ccagctttaa 
ggccactggt tatttactgc attatgtctc 
ttgagcatgg actatagcct ggcttcagag 
gtgctgggct gtggctgggg ggactgtggg 
cagggaaaag tgtgggtaac tatttttaag 
acgtagggtg tgtactctcg aagattaaca 
acagtggaag cattcaaggg tagatcatct 
aagcggtatc agaagagcga ggaaggtaag 
gcagtctggg aaagtagcac cccttgagca gagacaagga 
taggagaact ttcttgctga attctacttg caagagcttt 

tggtgagctg 
ttcttccccc 



tgttctaaat 

tgcttccaaa 

agtttcgcag 

gccaggtgaa 

gactccaagc 

tactgtgttg 

gtgtgggttc 

aacgacacca 

cagtcttcat 



ctgtagtgag gaaagtaaca 3060 
ttgaaggtgg caggcaactt 



ctaacctggc ttctccacta 
ggttgggatg ggtggaagga 
tgagcttggg gtgggcagca 
caaacgtctc atctgcaaat 
agtaatatat ggatgaattc 
gatcatcaag ctatgattgg 
atgttttccc tccacgtaaa 
aataattcag gagcatgtgc 
gatgcctggc ttctggtgcc 
gagggaaaga ttctgctcaa 
agcactgtgc agcagagtgg 



ttctgcagca cctgcaaggc ccagagcctg 
gtccaagctt cagcaggtca ttgtctttgc 
aactgatgtc gaagcctcct gtccactacc tgttgctgca ggcagactgc tctcagaaaa 
agagagctaa ctctatgcca tagtctgaag gtaaaatggg ttttaaaaaa gaaaacacaa 



gcctagggct ctgcccgcga 
ttgagattta gacacaaggg 
agcctgtact tcaaatatat 



aggcaaaacc ggctgcccca tgagaagaaa 
aagcccccag gcagtgtgac aggcccctcc 

agtgcgtgtt 
aagcctgaaa 
attttgtgag 

aagttgcaag agattgaagg ctgagtagtt 
gaaactactg cttctaaaca cttgtttgag 
gttacatgtc tgatgcactt gcttgtcctt 
cgcatttgtc acttatccca tatctgtcat 
tcagaagaaa cagatgtgat aatccccagc 
tctttccctt tttcctgcta agtaaggatt 
tcttcctgcc ttacattctg ggcattattt 
tttgtgtctt cctactctta gagtgaatgc 
gttggccgca gttctctgat gaacacacct 
tctgaggaac gggcagcgtt tgcctctgaa 
ttgcaactga tggtggaact ggtgcttaaa 



gcagtggtaa 
tgccacctag 
tctttggtgg 
ggaggtgttg 
ggagtgtagc 

* 

gagagggtaa 
tggtgagacc 
ttccatccac 



acatggtaga 
aggcgggaac 
gttttgtttg 
ggcactattt 
gaattggcca 
cacgtttaat 
ttggataggt 
atccatgcat 



aaaggtgcag 

aagcttccct 

gcgtttggtt 

tggtttgtaa 

atttaaaata 

gagatcttct 

gagtgctctt 

tccacatcca 



atctgacata cctgtctctt cgtcacttgg 
cgccccaagt ttgagaagat ggcagttgct 
ttctcctggc tttgacacct cacgaaatag 
caaatatctt tggagtgcgc tgctctcaag 
tcttagagtg aaagagaagg aagagaagat 
ctgaataatg gccaaaggtg ggtgggtttc 
agcaaggagc tctgcggagt tgcagttatt 
gcagattccc taggttccct gctacttctt 
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ttccttcttg gcagtcagtt tatttctgac 
gaaagtatgt ggctctgcct gggtgtgtta 
cgggcaccat tcatcccaaa caggatcctc 
ctccaacctc aaaacattaa ttggagtacg 
taagtcattt agtctggact ctgcagcatg 
cactgatgga ggagtagtaa aaatggagac 
aagaaactga tggaaataat gcatgaattg 
ctacttcaaa tgaggtcgga gaaggtcagt 
cgagtaccat ttttctctac aagaaaaacg 
catagcggct gaagctcccc cctggctgcc 
ccttggggtt tctctcacag cagtaatggg 
tgtcatgtgg gatccctact gtgccctcct 
cagcggtttg gaaagagaaa aagaatttgg 
ccagcatttt ggtttttaat tatgtcaata 
tgggtgtatt accgaggaac aaaggaaggc 



agacaaacag 

cagctctgcc 

attcatggat 

aatgtaatta 

taggtcggca 

cgattcagaa 

tgtggtggac 

gttttattag 

attctgagct 

tgccatctca 

acaatacttc 



ccacccccac tgcaggctta 4920 



ctggtgaaag gggattaaaa 
caagctgtaa ggaacttggg 
aaactgcatt ctcgcattcc 
gctcccactt tctcaaagac 
caaccaacgg agtgttgccg 
atttttttta aatacataaa 
cagccataaa accaggtgag 
ctgcgtaagt ataagttctc 
gctggagtgc agtgccattt 
acaaaaattc tttcttttcc 



ggttttacgt taccccctga ctgttccatt 
aaataaaaca tgtctacgtt atcacctcct 
actggcttag atttggaaat gagagggggt 
ttatataaac tcaagtcttt tatttagaga 



actggcaagc tgtcaaaaac aaaaaggcct taccaccaaa ttaagtgaat agccgctata 
gccagcaggg ccagcacgag ggatggtgca ctgctggcac tatgccacgg cctgcttgtg 



actctgagag caactgcttt ggaaatgaca 
tgcgtagagc gtgtgcttgg cgacagtttt 
tcctcattct cctaagcatg tctccatgct 
atgaatccat cactgtagga ttctcgtggt 
atggaagctt atttattttt cgttcttcca 
accacagcaa attaaaggtg aaggaggctg 
ttcttccttg caaggccaca ggaaaatgct 
agttcagtct cctgctggga cagctaaccg 
aggaccaaat agggtctatc tggggttttt 

♦ * • 

cactatttca ctgctcccac ggttacaaac 
acattacata aatttgacct ggtaccaata 
ctgtgtttaa ccccttaagg cattcagaac 
aggggcctta aacatcatcc atttccaacc 
ctcaggctgc ccagggcccc atccagcctg 



gcacttggtg caatttcctt tgtttcagaa 
tctagttagg ccacttcttt tttccttctc 
ggtaatccca gtcaagtgaa cgttcaaaca 
gatcaaatct ttgtgtgagg tctataaaat 
tatcagtctt ctctatgaca attcacatcc 
gtgggatgaa gagggtcttc tagctttacg 
gagagctgta gaatacagcc tggggtaaga 
catcttataa ccccttctga gactcatctt 
gttcctgctg ttcctcctgg aaggctatct 
caaagataca gcctgaattt tttctaggcc 
ttgttctcta tatagttatt tccttcccca 
aactagaatc atagaatggt ttggattgga 
ctctgccatg ggctgcttgc cacccactgg 
gccttgagca cctccaggga tggggcaccc 



* n n r\ 
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acagcttctc 
ttaacatcta 



tgggcagcct gtgccaacac 
atctaaatct cttctctttt 



ctatctgtcc aagaaatgtg tattggtctc 
ggctgcagtg aggtctcccc acagccttct 
cagcctgtct tcgtaggaga tcatcttagt 
cacggctttc ttgtggagcc ccaggtctgg 
gcagagcaga tggggacaat cgcttacccc 
cccagggtac tgttggcctt tcaggctccc 
catccaccag aacccacgct tcctggttaa 
tcaggagact tccattcttt aggacagact 
atatacattt cagttcatgt ttcctgtaac 
tacatgcaga attcctagtg ccatctcagt 
caatttgctg caagtacctt ccaagctgcg 



gggtaagctt ttgtatctgc 
ttctgactgc accattttct 
tcctttccca gcttgtatct 
cttcagcagc catttaattc ttcagtgtca 
ttttcagcag tcttgcaaag aacatctagc 



ttaccttttg 
ctctgctctg 
ttgtcctcca 



cagttcttct 
gcattttatt 



tgtttgaggt gagccataaa 
acttctatta tgtacttact 
cacagtgtct ctgtgtcctt 



ctgggatttc 

aaccttggca atctgcccag ctgcccatca 
tcttcagcca ataaacaaaa tgtgagaagc 
tcaagggaga gacagctgaa gggttgtgta 



tgtcagacag ttttgcctga tttatacagg 
aggccacctt gcagtccttg gtttgtaaga 
cgtggagaat catgatggca gttcttgctg 
cagcaaagta acacttgctg ctgtaggtgc 
caccaagatg agggatgctc ccagctgacg 
ctgcctgctc attagcatca cctcagccct 



ctcaccactc 
agtttaaagc 
cctcctgctt 
cttctccagg 
ggccctcctc 
atgcagtact 
tccctgctgg 
agaccccttg 
tacttctgcc 
gtgttacacc 
aggacagaat 
agggttttca 
gcctcccata 
agagaccctg 
agatcaccca 
ttgacaaata 
tcttgttctg 
tgaaaacttt 
ttactagaac 
ttgacataac 
cacatggttt 
caagaaaaga 
ccaaacaaga 
gctcaataga 
cacgccccaa 
taagtcatag 
tttactatgg 



tctgggtaaa gaattctctt 
cattcctctt tttcccgttg 



ataagcagga agtactggaa 
ctgaacaagc ccagctcctt 
tggacccatt ccaacagttc 
tcagatgggg ccttacaaag 
ctgcccctgt tttgatgcag 
ctgatttgtg tcaagctttt 
ctcacttctg taagcttgtt 
tacctgccct attcttgcat 
atgtattcct ctaacaaaaa 
tggcagtatt agcacatagt 
aatcctgtat ttgggatcag 
ggggttctga tgtgcttcag 
gttgttcctg tacaacttcc 
caggcctatt tttgtgtttg 



ttgatgccac 

ctgccattca 

ttcgtcactg 

acagacacgc 

tactgtcata 

gattcctttt 

acttgtgggg 

attaagaaat 



tggaacagga 
atattcttac 
acaagtttat 
acatattttg 
cttccgttat 
ttattacttc 
caggctgcca 
aataaagctg 



gccagagagg ctgtctgcca 
gtaacttttc tggtgaattg 
taagatgcta aaataggaga 



tctgctatcc agacagcgat ggcactcgca 
gatgctgggg cagtaacagt gggtcccatg 
caccagccca tcagaaggat catcccaagc 



tgaggaaagt tgctcatctt cttcacatca tcaaaccttt ggcctgactg atgcctcccg 
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gatgcttaaa tgtggtcact gacatcttta tttttctatg atttcaagtc 
gatcaggagg gaacacatag tgggaatgta ccctcagctc caaggccaga 
atgatcatgc atgctactta ggaaggtgtg tgtgtgtgaa tgtagaattg 
tttttcttcc tgctgtcagg aacattttga ataccagaga aaaagaaaag 
ggcatgggag gagttgtcac acttgcaaaa taaaggatgc agtcccaaat 
tcagggtctg aaggaggatc agaaactgtg tatacaattt caggcttctc 
ttttgaaagc tgttcctggc cgaggcagta ctagtcagaa ccctcggaaa 
tgtcttcaag gtgcagcagg aggaaacacc ttgcccatca tgaaagtgaa 



agaacctccg 8640 



cgctgaagga 

gttcattttt 

tctccagttg 

ttctgtttga 

gaattgcaaa 

tctggctatg 

agtttgaaga 

atagctatgg 

gtatgaactc 

gcagtgattt 

tcagggaaaa 

atcacagctc 

agggcctcaa 

aggcaatcct 

tggcactttt 



atccagctcc 
ataggacttc 
gcagatgact 
ccaccatgga 
gcaggagtta 



tgtttgagca ggtgctgcac 
caggaaggat cttcttctta 
atgactactg acaggagaat 
gtcacccatt tctttactgg 
gcgaagatct tcatttcttc 



aaagtctgct tacaaggaag aggataaaaa 



caatgaggtt ttagctgcat 



tatttacgtg 
aggtctctcg 
agggtttatg 
aaaaaaaaag 
agtgcggtcc 
gataactgat 



tctttttgct 
ggctactggc 
agtacttttg 
ccaaccctga 
cagagaacac 
gttagtcaga 
ctccgctgca 



ttgacatgaa 
tagttactta 
atggattgat 
cagtaaatca 
cagacatccc 
agggactctt 
agactttcca 
cagttccagt 



actcccacac 
agcttcttaa 
gaggaactag 
tatttggaaa 
catgttggtg 
tcatagggat 
gaaattgaga 
ttgaccccag 
tacatacaac 



tcttccttca 
cctttgttat 
tgctcttctt 
gttcataatc 
tgaatgcagc 
caggaacaaa 
taaccactgc 
tgaaacaaca 
ttatggtaca 
ctgggaatat 
taataattct 
acagcacagt 
aataaatcta 



ggaattttct 
tgggtcaggc cgtgatccaa ggagcagaag 



tagggttagt 
agctcaggtg 
ctcttaggac 
ttctggccac 
catcccagtt 
ttccagctat 



tgcctgaccg tcccaactca 
ttgaaattgc agtgtggccc 
ttaatcctca gcaagtgcaa 

* 

atcagtatca acaagtggtt 
tactctctaa tggagttgca 
ggctgctaaa cattagggtc 



ctgcactcaa 
agaggggctg 
tttgcagcca 
tggcttggaa 
ttttgaagca 
aattttccag 



acaaaggcga 
caccagtact 
ttaaattgaa 
gatggagtct 
ggacactgtg 
tgcactttct 



aaccacaaga 
ggattgacca 
ctaactgata 
aggggctcta 
aaaagctggc 
gaagtgtctg 



cctctactgg 

ctgaggtcaa 

tgtaatttta 

aatgttaatc 

gaaatcaagg 

ctttatgtac 

agttcagctg 

tgtacagttc 

ggtcagggag 

gtggcttttg 

cgaggcaaca 

ctacaatgca 

caggagtagc 

ctcctaaaga 

cagttcccca 



tgcaaagctg cccaaacata gcacttccaa ttgaatacaa ttatatgcag gcgtactgct 
tcttgccagc actgtccttc tcaaatgaac tcaacaaaca atttcaaagt ctagtagaaa 



rt -i n a 
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9240 
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9480 
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9660 
9720 
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9900 
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10200 
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10380 
10440 
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gtaacaagct ttgaatgtca ttaaaaagta tatctgcttt cagtagttca gcttatttat 10500 
gcccactaga aacatcttgt acaagctgaa cactggggct ccagattagt ggtaaaacct 10560 
actttataca atcatagaat catagaatgg cctgggttgg aagggacccc aaggatcatg 10620 
aagatccaac acccccgcca caggcagggc caccaacctc cagatctggt actagaccag 10680 
gcagcccagg gctccatcca acctggccat gaacacctcc agggatggag catccacaac 10740 
ctctctgggc agcctgtgcc agcacctcac caccctctct gtgaagaact tttccctgao 10800 
atccaatota agccttocct ccttgaggtt agatccactc ccccttgtgc tatcactgto 10860 
tactcttgta aaaagttgat tctcctcctt tttggaaggt tgcaatgagg tctccttgca 10920 
gccttcttct cttctgcagg atgaacaagc ccagctccct cagcctgtct ttataggaga 10980 
ggtgctccag ccctctgatc atctttgtgg ccctcctctg gacccgctcc aagagctcca 11040 
catctttcct gtactggggg ccccaggcct gaatgcagta ctccagatgg ggcctcaaaa 11100 
gagcagagta aagagggaca atcaccttcc tcaccctgct ggccagccct cttctgatgg 11160 
agccctggat acaactggct ttctgagctg caacttctcc ttatcagttc cactattaaa 11220 
acaggaacaa tacaacaggt gctgatggcc agtgcagagt ttttcacact tcttcatttc 11280 
ggtagatctt agatgaggaa cgttgaagtt gtgcttctgc gtgtgottot tcctcctcaa 11340 
atactcctgc ctgatacctc accccacctg ccactgaatg gctccatggc cccctgcagc 11400 
cagggccctg atgaacccgg cactgcttca gatgctgttt aatagcacag tatgaccaag 114 60 
ttgcacctat gaatacacaa acaatgtgtt gcatccttca gcacttgaga agaagagcca 11520 
aatttgcatt gtcaggaaat ggtttagtaa ttctgccaat taaaacttgt ttatctacca 11580 
tggctgtttt tatggctgtt agtagtggta cactgatgat gaacaatggc tatgcagtaa 11640 
aatcaagact gtagatattg caacagacta taaaattcct ctgtggctta gccaatgtgg 11700 
tacttcccac attgtataag aaatttggca agtttagagc aatgtttgaa gtgttgggaa 11760 
atttctgtat actcaagagg gcgtttttga caactgtaga acagaggaat caaaaggggg 11820 
tgggaggaag ttaaaagaag aggcaggtgc aagagagctt gcagtcccgc tgtgtgtacg 11880 
acactggcaa catgaggtct ttgctaatct tggtgctttg cttcctgccc ctggctgcct 11940 
taggg 



11945 



<210> 8 

<211> 285 

<212> DNA 

<213> SV40 

<220> 

<221> misc_feature 

<222> (1)..(285) 

<223> SV40 Polyadenylation Sequence 
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aaagtctaga gtcggggcgg ccggccgctt cgagcagaca tgataagata cattgatgag 
tttggacaaa ccacaactag aatgcagtga aaaaaatgct ttatttgtga aatttgtgat 
gctattgctt tatttgtaac cattataagc tgcaataaac aagttaacaa caacaattgc 
attcatttta tgtttcaggt tcagggggag gtgtgggagg ttttttaaag caagtaaaac 
ctctacaaat gtggtaaaat cgataaggat ccgtcgagcg gccgc 



<* A 



<210> 9 

<211> 5972 

<212> DNA 

<213> Gallus gallus 

<220> 

<221> misc__f eature 

<222> (1)..(5972) 

<223> Lysozyme 3prime domain 



cgcgtggLg gtggcggggg gttcccagga gagcccccag cgcggacggc agcgccgtca 
ctcaccgctc cgtctccctc cgcccagggt cgcctggcgc aaccgctgca agggcaccga 
cgtccaggcg tggatcagag gctgccggct gtgaggagct gccgcgcccg gcccgcccgc 
tgcacagccg gccgctttgc gagcgcgacg ctacccgctt ggcagtttta aacgcatccc 
tcattaaaac gactatacgc aaacgccttc ccgtcggtcc gcgtctcttt ccgccgccag 
ggcgacactc gcggggaggg cgggaagggg gccgggcggg agcccgcggc caaccgtcgc 
cccgtgacgg caccgccccg cccccgtgac gcggtgcggg cgccggggcc gtggggctga 
gcgctgcggc ggggccgggc cgggccgggg cgggagctga gcgcggcgcg gctgcgggcg 
gcgccccctc cggtgcaata tgttcaagag aatggctgag ttcgggcctg actccggggg 
cagggtgaag gtgcggcgcg ggcggaggga cggggcgggc gcggggccgc ccggcgggtg 
ccggggcctc tgccggcccg cccggctcgg gctgctgcgg cgcttacggg cgcgcttctc 
gccgctgccg cttctcttct ctcccgcgca agggcgtcac catcgtgaag ccggtagtgt 
acgggaacgt ggcgcggtac ttcgggaaga agagggagga ggacgggcac acgcatcagt 
ggacggttta cgtgaagccc tacaggaacg aggtagggcc cgagcgcgtc ggccgccgtt 
ctcggagcgc cggagccgtc agcgccgcgc ctgggtgcgc tgtgggacac agcgagcttc 
tctcgtagga catgtccgcc tacgtgaaaa aaatccagtt caagctgcac gagagctacg 
ggaatcctct ccgaggtggg tgttgcgtcg gggggtttgc tccgctcggt cccgctgagg 
ctcgtcgccc tcatctttct ttcgtgccgc agtcgttacc aaaccgccgt acgagatcac 
cgaaacgggc tggggcgaat ttgaaatcat catcaagata tttttcattg atccaaacga 



120 
180 
240 
285 



60 
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240 
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gcgacccgta 
gggctgctcg 
cccgggacgt 
ggcttctctg 
aggccgtgga 
tcctctcttt 
aagcacttgg 
acgaatactg 
aaagccatta 
ggagataact 
gtaaccctgt 
aaaactgtag 
ctgactttat 
ttcaagatcc 
gtgcttacaa 
ttcactacta 
gaatagcttg 
aagtagaata 
gcaaagatta 
tggaggaccg 
ttcagatttt 
gtttttgttt 
gatggactga 
ccacttatca 
tttttcagga 
aaagataatt 
accagggaaa 
gaaaggttaa 
gaagaggatg 
gagaatgaaa 
ctgtatgaat 



agtacgctca 
ctgctgccgg 
gtgctctgtg 
gcgcttagcg 
aaccgctctc 
ggaaatgctt 
ccagacaatt 
aatttaagta 
aaatcctgtg 
ggtgtttccc 
atcacttgct 
tttctgaatt 
ttaaaaaaaa 



gcttctcgta 

tcacagtccc 

ttcggtcagc 

caggagcacg 

aaactgattt 

tagagaaggt 

taaagacatc 

actctgctca 

cagttgttta 

tacagttctc 



gtgcttcccc cgtcctggcg gcccggggct 
gccagccgcg gagctgactg agctcccttt 
gaggctatcg ggagggcttt ggctgcattt 
ttgtgctacg cctgaactac agctgtgaga 
attggcgaaa tggctctaaa ctaaatcgtc 
ctctgtggta gttcttatgc atctatccta 
aagcagcatt tatagcaggc acgtttaata 
cgttgtatga cgtttatttt cgtattcctg 
gtaagaacag ctgccactgt tttgtatcta 
aagctgataa aactctgtct ttgtatctag 



1200 



-i i en 



gaagcttttt cagtctgaca 
ctatga.tgaa atggtatgaa 
ttattgatgg tgctgtgtat 



tactgecatg 
gcatgaaaca 
atgttttctg 
gtattggcag 
aatgeagect 
tacttcctca 
tgaccgaggg 
cttttcacct 



ttcactctgg 
atgaggtcat 
tttttactgc 
cgtgtgcttt 
catgtttctt 
agctggaagc 
aagcaagtcg 
atcagtctaa 
ggacttcagt 
ttcatttgea 



atgeagcaac 
gagtgtaagt 
tgggatgtga 
tgatgacgtg 
gctttatttg 
ctgaattgea 
acattggatc 
agtctttaat 
caaatgatgt 
caaacaaact 



tgetaacgae 
gcaaaatgag 
tegtacagtg 
ategatgect 
agatagtttg 
ctgtccaaag 
gctatctccc 
tcccagggtt 
tgtgaattac 
tttcttcttc 



tgttgttgag 
ccaaattact 
cttctttttt 
tgccaaaaag 
tgaaaccatc 
agatatgtga 
cgtggagttg 
aatcactgea 



tctgtaaggc 
gecacatata 
gccaccacag 
aaaaccagtt 
aactgettaa 
tgagtgttga 
tatgegttet 
gtgtgtgaca 



ecaatgeaat 
aattttaatg 
tttggtcctt 
gtcccgtcag 
gataccttcg 
agtttggctg 
tgcttatcat 
gttcatttta 
gtgtgaaatg 
atttcttttg 
ttgttttttc 
actgettcag 
egtatttect 
taaaagtaac 



taaagaaagg 
ttgeagatet 
ttgaaattgc 
agagtgaaat 
cttggcaggg 
ctccaattct 
actgactttt 



cctgggaaag 
tcaaccgagc 
ccttagatat 
ctgacacttg 
ccgaccgtca 
tgtgaaattt 
gtttgaaatg 
tggaatgcaa 
tgtggggatc 
ctgttaccag 
cttggtcata 
ccacaaaact 
tttttttccc 
tgttttgtgc 
ttggaatttt 
tgaagtaaaa 
tgagcttaaa 
cagaaaactc 
agectataat 
gtaaeggaga 
tataaatggc 
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agaaaacaag aatgaatgta tcctcatttt atagttaaaa tctatgggta tgtactggtt 
tatttcaagg agaatggatc 
atttgagtgg tgtaggaaca 
ttcactgttt tgttttgtat 
aaataaaaca tgcaggtttt 
gtattattgt taacagcaag 
gatgcttaag tctgttgtca 
catttctaat aacttcttta 
tatgtagttg atgcctttat 



gtagagactt ggaggccaga ttgctgcttg. tattgactgc 
ttttgtctat ggtcccgtgt tagtttacag aatgccactg 
tttacttttt ctactgcaac gtcaaggttt taaaagttga 
ttttaaatat ttttttgtct ctatccagtt tgggcttcaa 
tcctgattta agtcagaggc tgaagtgtaa tggtattcaa 
gcaaaacaaa agagaaaact tcataaaatc aggaagttgg 
tcaacagata agagtttcta gccctgcatc tactttcact 
attttgtgtg tttggatgca ggaagtgatt cctactctgt 
tatgtagata ttctatttaa cacttgtact ctgctgtgct tagcctttcc ccatgaaaat 
tcagcggctg taaatccccc tcttcttttg tagcctcata cagatggcag accctcaggc 
ttataaaggc ttgggcatct tctttactgc tttgagattc tgtgttgcag taacctctgc 
cagagaggag aaaagcccca caaacctcat ccccttcttc tatagcaatc agtattacta 
atgctttgag aacagagcac tggtttgaaa cgtttgataa ttagcattta acatggcttg 
gtaaagatgc agaactgaaa cagctgtgac agtatgaact cagtatggag acttcattaa 
gacaaacagc tgttaaaatc aggcatgttt cattgaggag gacggggcaa cttgcaccag 
tggtgcccac acaaatcctt cctggcgctg cagaccaatt tttctggcat tctgactgcc 



gcaactattt 

agtataaatg 

acaaattagc 

agccctcttc 

gctatcgaag 

aatttgcttt 

ctctgtgcaa 

tgagagggct 

gccggtttct 

ccaaggaggc 

tgggcagcct 

atcactgtgg 



gttgctgctg gtcacagaga 
ttttccaagt gttgtaggta 
attaaaaaca taaatgaaag 
ccagcctgct ttttggtaga 
aaactggcaa cggagctatt 
aaggtttggt ttattcatta 
ttccatgcac caggtggcag 
tgccgggatt ctggcacgtc 
tactgctttc tgtagacatg 
aacacactgg aacaggttgc 
ggccaggctg gatgtggctc 
agcggggttg aaactcgatg 
tatgattcaa cagcaaatca tatgtactga gagaggaaac aaacacaagt gctactgttt 
gcaagttttg ttcatttggt aaaagagtca ggttttaaaa ttcaaaatct gtctggtttt 



caggcaattt gcttgtagta 
agagcacttt gagtatactt 
tgcacagaac atttttagtt 
actgactttg acaagtagag 
aacaaagtta atcgtctttt 
ctgaaaaaga acgtgctgtc 
ggtctcacca gccttttaat 
tcgtttgttt gaacagcgtg 
ttatgaaact gttcacactg 



ttatcagcca 
catcggctcc 
tttgcttggg 
tgaggctaga 
gatccttgtt 
taagctgtag 
agtgctctct 
cagactggct 
ctcctgcagc 
cgtggatgcc ccatccctgg aggcattcaa 
ggtctggtgg ttggcgatcc tgcacatagc 
tccttttcaa cccaggctat tctatgattc 
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ggtgtttttt tttttttatt tattatttct ttggggttct ttttgatgct ttatctttct 
ctgccaggac tgtgtgacaa tgggaacgaa aaagaacatg ccaggcactg tcctggattg 
cacacgctgg ttgcactcag tagcaggctc agaactgcca gtctttccac agtattactt 
tctaaaccta attttaatag cgttagtaga cttccatcac tgggcagtgc ttagtgaatg 
ctctgtgtga acgttttact tataagcatg ttggaagttt tgatgttcct ggatgcagta 
gggaaggaca gattagctat gtgaaaagta gattctgagt atcggggtta caaaaagtat 
agaaacgatg agaaattctt gttgtaacta attggaattt ctttaagcgt tcacttatgc 
tacattcata gtatttccat ttaaaagtag gaaaaggtaa aacgtgaaat cgtgtgattt 
tcggatggaa caccgccttc ctatgcacct gaccaacttc cagaggaaaa gcctattgaa 
agccgagatt aagccaccaa aagaactcat ttgcattgga atatgtagta tttgccctct 
tcctcccggg taattactat actttatagg gtgcttatat gttaaatgag tggctggcac 
tttttattct cacagctgtg gggaattctg tcctctagga cagaaacaat tttaatctgt 
tccactggtg actgctttgt cagcacttcc acctgaagag atcaatacac tcttcaatgt 
ctagttctgc aacacttggc aaacctcaca tcttatttca tactctcttc atgcctatgc 
ttattaaagc aataatctgg gtaatttttg ttttaatcac tgtcctgacc ccagtgatga 
ccgtgtccca cctaaagctc aattcaggtc ctgaatctct tcaactctct atagctaaca 
tgaagaatct tcaaaagtta ggtctgaggg acttaaggct aactgtagat gttgttgcct 
ggtttctgtg ctgaaggccg tgtagtagtt agagcattca acctctagaa gaagcttggc 
cagctggtcg acctgcagat ccggccctcg ag 



4920 
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<210> 10 

<211> 18391 

<212> DNA 

<213> Gallus gallus 

<220> 

<221> misc_feature 

<222> (1)..(237) 

<223> Sprime matrix (scaffold) attachment region- (MAR) 



<220> 
<221> 
<222> 
<223> 



misc_feature 

(261) (1564) . t nns 

5prime matrix (scaffold) attachment region (MAR) 



<220> 
<221> 
<222> 
<223> 



misc_feature 
(1565) (1912) 

5prime matrix (scaffold) attachment region (MAR) 
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<220> 
<221> 
<222> 
<223> 



misc_f eature 
(1930) . . (2012) 



Sprime matrix (scaffold) attachment region 



(MAR) 



<220> 
<221> 
<222> 
<223> 



misc_f eature 
(2013) . . (2671) 
Intrinsically curved DNA 



<220> 
<221> 
<222> 
<223> 



misc_f eature 
(5848) . . (5934) 
Transcription enhancer 



<220> 
<221> 
<222> 
<223> 



mi sc__f eature 
(9160) . . (9325) 
Transcription enhancer 



<220> 
<221> 
<222> 
<223> 



mi sc_f eature 
(9326) .. (9626) 

Negative regulatory element 



<220> 
<221> 
<222> 
<223> 



mis cofeature 
(9621) (9660) 
Hormone response element 



<220> 
<221> 
<222> 
<223> 



mis c_f eature 
(9680) . . (10060) 
Hormone response 



element 



<220> 
<221> 
<222> 
<223> 



misc_feature 
(10576) . . (10821) 
Chicken CR1 Repeat Sequence 



<220> 
<221> 
<222> 
<223> 



misc_feature 
(10926) (11193) 
Chicken CR1 Repeat Sequence. 



<220> 
<221> 
<222> 
<223> 



mis c_f eature 
(11424) . . (11938) 

Lysozyme Proximal Promoter and Lysozyme Signal Peptide 



<220> 

<221> misc feature 
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<222> (11946) .(12443) ro don-optim* zed for expression in chick 

<223> numan interferon alpha ID coaon opllu— ±. r 



ens 



<220> 
<221> 
<222> 
<223> 



misc_f eature 
(12464) . . (18391) 
Chicken Lysozyme 3prime domain 



<400> 10 

tgccgccttc tttgatattc actctgttgt 
taacagtctg tataacagtc tgtgaggaaa 

ttgaatattg gataaggctg 
tggttggggt ggtggcagct 
tttttttttt aagtaaggtg 
tgacaggtca gaaacatttc 
tttcattccc tttttgcttt 



atttcatctc ttcttgccga tgaaaggata 
tacttggtat ttcttctgat cagtgttttt 



ataagtaatg 
cagcaggtgg 
tttttttttt 
ctgtatgttt 
agcccttttc 
tatggaaaac 
tagctgttgt 
cctttgaagt 
ttctaatggg 
cctagttaac 
ctagtaaaaa 
tctatgctct 
atatgcagtc 
atgctcagta 
gccttcccga 
cccagggaag 
ttttggcata 
agtgcatagg 
tgatgcctgg 
aggggtgggg 
catctcacct 
acagagaagt 
gtatgataaa 



gttgatcgga 
tacacgagat 
agtgagcgtt 
attttttacc 
atgttgataa 
tatatggcat 



acttgaggtt 
accttattaa 
ctctggtttt 
tgatgatcta 
cttcggattt 
ttatagaaat 



tgtgtccttt 
cagtgacagg 
ttcttttttc 
ttcaaaagaa 
ctgtgccaat 
tttatttata 



acaaagccca 



gtcttgggag 
agaggttttt 
ttagtaaatt 
gaaccttttg 
gcctttggtt 
gtgtggcttg aaagcttgga 



ttgcctgttt 
ttctactgga 
gaaactgtac 
ctgattgcat 



gtttaggcca gcttgatgct 
tttcctttga aactggtgag 



ttattttttc 
gcttagattt 
taaatgtttt 



gtgtgtacag 
tgtcgttggt 
aggcgggttg 
gatccaggac 
tgcagatcca 
ggaagcaagc 
ttttaaacac 

» 

atgcctgttg 
tgctttgtgt 
gctgggtact 
tcctcagttg 
gcatctctat 



gtcaaacaga 
tcttgtgttg 
tcacatgggt 
actaaactgc 
cgtgcatatt 
tgcatggatt 
agttgcagca 
cagctgttta 
cgtgttccca 
tttcaaacca 
gatattctca 
ttgtaaatta 



gttgcatacc caaatgcttg 
acatgttgta tatacttgtc 
acgtaattcc tgatttcctt 
cttcactcct atttttattt atagaatttt 



atctgtgttt 
tttttttatc 



taaggataca gccttaaatt 
tcaaatgtaa aacgggcacg 
ttctgcactg aggtataaat 
cttaaagaag aatgaatact 



tcctagagcg 
tttggctgct 
cgcttcagat 
ttctaaaata 



tgtttgggac ttaaattatt ttggtaacgg 
tgctaacgag tcacagcgtt tatgcagaag 
cggcactgcc ttgcagtgag cattgcagat 
cacgctgcca cacagccacc tcccggaaca 
tcttagcagt agtagatgag ttactatgaa 
tgggatgtct tttttcccat gttgggcaaa 
tgcacttgtt agttcctgaa tcctttctat 
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agcaccactt attgcagcag gtgtaggctc tggtgtggcc 
ttaaagcttc tttggaaata cactgacttg attgaagtct 
cttacctttg atcccaatga aatcgagcat ttcagttgta 
accatgtaat gtaattttac acccccagtg ctgacacttt 
actttggcct caccctcttg tgtactgtat tttgtaatag 
tatgattatt acattatgaa agagacattc tgctgatctt 
gtgcgtgtgc ttttataaat acaagtgatt gcaaattagt 
aaaaaaaaag taatataaaa aggaccaggt gttttacaag 
taaacagtta catttttatg aagattacca gcgctgctga 
attgtcttcc tgtaccattg catttcctca ttcccaattt 
actattcaag aaatggcttt gaaatacagc atgggagctt 
ttgcactgca aaatgtcagg aaatggatgt ctctcagaat 
atatgtgtat atagtaagca gtttcctgat tccagcaggc 



tgtgtctgtg cttcaatctt 
cttgaagata gtaaacagta 
aaagaattcc gcctattcat 
ggaatatatt caagtaatag 
aaaatatttt aaactgtgca 
caaatgtaag aaaatgagga 
gcaggtgtcc ttaaaaaaaa 
tgaaatacat tcctatttgg 
ctttctaaac ataaggctgt 
gcacaaggat gtctgggtaa 
gtctgagttg gaatgcagag 
gcccaactcc aaaggatttt 
caaagagtct gctgaatgtt 



gtgttgccgg agacctgtat ttctcaacaa ggtaagatgg tatcctagca actgcggatt 

tttagggatc gtttcatcat 
ttcatgggtt cctttttttc 
tgtgttggtc ttaaactgca 
tatattaaaa aatacttttg 
cattatgtca gttctgtcag 
acagctgcgg ttttgatgct 
gctcatcaaa cacttcatgc 
cttttttaaa gtcaactttt 
gactgcatag cttctgaatc 
tcatcttact taaacttcat 
aatagagaca gtgcgctcag 
ctgtagtgag gaaagtaaca 
ttgaaggtgg caggcaactt 
ctaacctggc ttctccacta 
ggttgggatg ggtggaagga 
tgagcttggg gtgggcagca 
caaacgtctc atctgcaaat 



ttaatacatt ttcagcagaa gtacttagtt 
ttttagatgt tatacttgaa atactgcata 
agcctttagg agactgttaa gcaatttgct 
atagtagttt accttgtatt gaagaaataa 
tctgtcttca ttttgacttg tctgatatcc 
atattcagac atcaaaactt aacgtgagct 
gttattattt ctgaaactag aaatgatgtt 
agagtgtaag gctagtgaga aatgcataca 
tatcagattt ttttttcatt tggaaatata 
tgaaatgcag tctgattggc atgaagaagc 
tttggaatga aggaagttaa. gcaagggcac 
gagaaagtga acctggattt ctttggctag 
cccgattcct tgaaagggct ccagctttaa 
ggccactggt tatttactgc attatgtctc 
ttgagcatgg actatagcct ggcttcagag 
gtgctgggct gtggctgggg ggactgtggg 
cagggaaaag tgtgggtaac tatttttaag 



aatctctacc 

acttttagct 

gtccaacttt 

agaccatttt 

ttgcagtgcc 

cagtggagtt 

gtcttcatct 

tttattgata 

ttgttttcta 

acagcactct 

aggtccatga 

tgttctaaat 

tgcttccaaa 

agtttcgcag 

gccaggtgaa 

gactccaagc 

tactgtgttg 
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acgtagggtg tgtactctcg aagattaaca 
acagtggaag cattcaaggg tagatcatct 
aagcggtatc agaagagcga ggaaggtaag 
gcagtctggg aaagtagcac cccttgagca 



taggagaact ttcttgctga attctacttg caagagcttt gatgcctggc 
ttctgcagca cctgcaaggc ccagagcctg 
gtccaagctt cagcaggtca ttgtctttgc 
aactgatgtc gaagcctcct gtccactacc 
agagagctaa ctctatgcca tagtctgaag gtaaaatggg ttttaaaaaa gaaaacacaa 

gcagtggtaa acatggtaga aaaggtgcag 



tggtgagctg gagggaaaga 
ttcttccccc agcactgtgc 
tgttgctgca ggcagactgc 



ctatgattgg 
tccacgtaaa 
gagcatgtgc 
ttctggtgcc 
ttctgctcaa 
agcagagtgg 
tctcagaaaa 



OHOKJ 



gtgtgggttc agtaatatat ggatgaattc 3420 
aacgacacca gatcatcaag 
cagtcttcat atgttttccc 
gagacaagga aataattcag 



aggcaaaacc ggctgcccca tgagaagaaa 
aagcccccag gcagtgtgac aggcccctcc 
gcctagggct ctgcccgcga agtgcgtgtt 
ttgagattta gacacaaggg aagcctgaaa 
agcctgtact tcaaatatat attttgtgag 
aagttgcaag agattgaagg ctgagtagtt 



tgccacctag 
tctttggtgg 
ggaggtgttg 
ggagtgtagc 
gagagggtaa 



gaaactactg 
gttacatgtc 
cgcatttgtc 
tcagaagaaa 
tctttccctt 
tcttcctgcc 
tttgtgtctt 
gttggccgca 
tctgaggaac 



cttctaaaca cttgtttgag tggtgagacc 

ttccatccac 



tgatgcactt gcttgtcctt 
acttatccca tatctgtcat 
cagatgtgat aatccccagc 
tttcctgcta agtaaggatt 
ttacattctg ggcattattt 
cctactctta gagtgaatgc 
gttctctgat gaacacacct 
gggcagcgtt tgcctctgaa 



aggcgggaac 
gttttgtttg 
ggcactattt 
gaattggcca 
cacgtttaat 
ttggataggt 
atccatgcat 



atctgacata cctgtctctt 
cgccccaagt ttgagaagat 
ttctcctggc tttgacacct 
caaatatctt tggagtgcgc 
tcttagagtg aaagagaagg 
ctgaataatg gccaaaggtg 
agcaaggagc tctgcggagt 
gcagattccc taggttccct 



ttgcaactga tggtggaact ggtgcttaaa 

ttccttcttg gcagtcagtt tatttctgac agacaaacag ccacccccac 
gaaagtatgt ggctctgcct gggtgtgtta cagctctgcc ctggtgaaag 
cgggcaccat tcatcccaaa caggatcctc attcatggat caagctgtaa 
ctccaacctc aaaacattaa ttggagtacg aatgtaatta aaactgcatt 



aagcttccct 

gcgtttggtt 

tggtttgtaa 

atttaaaata 

gagatcttct 

gagtgctctt 

tccacatcca 

cgtcacttgg 

ggcagttgct 

cacgaaatag 

tgctctcaag 

aagagaagat 

ggtgggtttc 

tgcagttatt 
gctacttctt 
tgcaggctta 
gggattaaaa 
ggaacttggg 
ctcgcattcc 



taagtcattt agtctggact ctgcagcatg taggtcggca gctcccactt tctcaaagac 
cactgatgga ggagtagtaa aaatggagac cgattcagaa caaccaacgg agtgttgccg 
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aagaaactga 
ctacttcaaa 
cgagtaccat 
catagcggct 
ccttggggtt 
tgtcatgtgg 
cagcggtttg 
ccagcatttt 
tgggtgtatt 
actggcaagc 
gccagcaggg 
actctgagag 
tgcgtagagc 
tcctcattct 
atgaatccat 
atggaagctt 
accacagcaa 
ttcttccttg 
agttcagtct 
aggaccaaat 
cactatttca 
acattacata 



tggaaataat 
tgaggtcgga 
ttttctctac 
gaagctcccc 
tctctcacag 
gatccctact 
gaaagagaaa 
ggtttttaat 
accgaggaac 
tgtcaaaaac 
ccagcacgag 
caactgcttt 
gtgtgcttgg 
cctaagcatg 
cactgtagga 
atttattttt 
attaaaggtg 
caaggccaca 
cctgctggga 
agggtctatc 
ctgctcccac 
aatttgacct 
ccccttaagg 
aacatcatcc 



gcatgaattg 
gaaggtcagt 
aagaaaaacg 
cctggctgcc 
cagtaatggg 
gtgccctcct 
aagaatttgg 
tatgtcaata 
aaaggaaggc 
aaaaaggcct 



tgtggtggac atttttttta aatacataaa 
gttttattag cagccataaa accaggtgag 
attctgagct ctgcgtaagt ataagt.tctc 
tgccatctca gctggagtgc agtgccattt 
acaatacttc acaaaaattc tttcttttcc 
ggttttacgt taccccctga ctgttccatt 
aaataaaaca tgtctacgtt atcacctcct 
actggcttag atttggaaat gagagggggt 
ttatataaac tcaagtcttt tatttagaga 
taccaccaaa ttaagtgaat agccgctata 



5280 



i 4 n 
IJ4U 



ggatggtgca ctgctggcac tatgccacgg 
ggaaatgaca gcacttggtg caatttcctt 
cgacagtttt tctagttagg ccacttcttt 



tctccatgct 



ggtaatccca gtcaagtgaa 
ttctcgtggt gatcaaatct ttgtgtgagg 

tatcagtctt ctctatgaca 



cgttcttcca 
aaggaggctg 
ggaaaatgct 
cagctaaccg 
tggggttttt 
ggttacaaac 
ggtaccaata 
cattcagaac 



ctgtgtttaa 
aggggcctta 
ctcaggctgc ccagggcccc atccagcctg 



gtgggatgaa gagggtcttc 
gagagctgta gaatacagcc 
catcttataa ccccttctga 
gttcctgctg ttcctcctgg 

♦ 

caaagataca gcctgaattt 
ttgttctcta tatagttatt 
aactagaatc atagaatggt 



cctgcttgtg 

tgtttcagaa 

tttccttctc 

cgttcaaaca 

tctataaaat 

attcacatcc 

tagctttacg 

tggggtaaga 

gactcatctt 

aaggctatct 

tttctaggcc 

tccttcccca 

ttggattgga 



atttccaacc ctctgccatg ggctgcttgc cacccactgg 



acagcttctc 
ttaacatcta 
ctatctgtcc 
ggctgcagtg 



tgggcagcct gtgccaacac 
atctaaatct cttctctttt 
aagaaatgtg tattggtctc 
aggtctcccc acagccttct 
tcgtaggaga tcatcttagt 



cagcctgtct 
cacggctttc ttgtggagcc ccaggtctgg 



gccttgagca cctccaggga tggggcaccc 
ctcaccactc tctgggtaaa gaattctctt 
agtttaaagc cattcctctt tttcccgttg 
cctcctgctt ataagcagga agtactggaa 
cttctccagg ctgaacaagc ccagctcctt 
ggccctcctc tggacccatt ccaacagttc 
atgcagtact tcagatgggg ccttacaaag 
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gcagagcaga 

cccagggtac 

catccaccag 

tcaggagact 

atatacattt 

tacatgcaga 

caatttgctg 

ttaccttttg 

ctctgctctg 

ttgtcctcca 

cttcagcagc 



tggggacaat 

tgttggcctt 

aacccacg'ct 

tccattcttt 

cagttcatgt 

attcctagtg 

caagtacctt 

gggtaagctt 

ttctgactgc 

tcctttccca 

catttaattc 



cgcttacccc 

tcaggctccc 

tcctggttaa 

aggacagact 

ttcctgtaac 

ccatctcagt 

ccaagctgcg 

ttgtatctgc 

accattttct 

gcttgtatct 

ttcagtgtca 



tccctgctgg ctgcccctgt 
agaccccttg ctgatttgty 
tacttctgcc ctcacttctg 



gtgttacacc 
aggacagaat 
agggttttca 
gcctcccata 
agagaccctg 
agatcaccca 
ttgacaaata 



tacctgccct 
atgtattcct 
tggcagtatt 
aatcctgtat 
ggggttctga 
gttgttcctg 
caggcctatt 



tcttgttctg ttgatgccac 



ttttcagcag tcttgcaaag aacatctagc tgaaaacttt ctgccattca 
cagttcttct tgtttgaggt gagccataaa ttactagaac ttcgtcactg 
gcattttatt acttctatta tgtacttact ttgacataac acagacacgc 
ctgggatttc cacagtgtct ctgtgtcctt cacatggttt tactgtcata 
aaccttggca atctgcccag ctgcccatca caagaaaaga gattcctttt 
tcttcagcca ataaacaaaa tgtgagaagc ccaaacaaga acttgtgggg 
tcaagggaga gacagctgaa gggttgtgta gctcaataga attaagaaat 



tgtcagacag ttttgcctga tttatacagg cacgccccaa 



aggccacctt 
cgtggagaat 
cagcaaagta 
caccaagatg 
ctgcctgctc 



gcagtccttg 
catgatggca 
acacttgctg 
agggatgctc 
attagcatca 



tgaggaaagt tgctcatctt 
gatgcttaaa tgtggtcact 
gatcaggagg gaacacatag 
atgatcatgc atgctactta 



gtttgtaaga 
gttcttgctg 
ctgtaggtgc 
ccagctgacg 
cctcagccct 
cttcacatca 



taagtcatag 
tttactatgg 
tctgctatcc 

gatgctgggg 
caccagccca 
tcaaaccttt 



gacatcttta tttttctatg 
tgggaatgta ccctcagctc 
ggaaggtgtg tgtgtgtgaa 



gccagagagg 
gtaacttttc 
taagatgcta 
agacagcgat 
cagtaacagt 
tcagaaggat 
ggcctgactg 
atttcaagtc 
caaggCcaga 
tgtagaattg 



tttttcttcc tgctgtcagg aacattttga ataccagaga aaaagaaaag 
ggcatgggag gagttgtcac acttgcaaaa taaaggatgc agtcccaaat 
tcagggtctg aaggaggatc agaaactgtg tatacaattt caggcttctc 



tttgatgcag 

1 . *. 

LCaayL. v_ u «- >- 

taagcttgtt 
attcttgcat 
ctaacaaaaa 
agcacatagt 
ttgggatcag 
tgtgcttcag 
tacaacttcc 
tttgtgtttg 
tggaacagga 
atattcttac 
acaagtttat 
acatattttg 
cttccgttat 
ttattacttc 
caggctgcca 
aataaagctg 
ctgtctgcca 
tggtgaattg 
aaataggaga 
ggcactcgca 
gggtcccatg 
catcccaagc 
atgcctcccg 
agaacctccg. 
tcttccttca 
cctttgttat 
tgctcttctt 
gttcataatc 
tgaatgcagc 
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ctagtcagaa 

ttgcccatca 

ggtgctgcac 

cttcttctta 

acaggagaat 

tctttactgg 

tcatttcttc 

aggataaaaa 



ccctcggaaa 
tgaaagtgaa 
actcccacac 



agcttcttaa ttatggtaca 
gaggaactag 
tatttggaaa 
catgttggtg 
tcatagggat 



ttttgaaagc tgttcctggc cgaggcagta 
tgtcttcaag gtgcagcagg aggaaacacc 
cgctgaagga atccagctcc tgtttgagca 
gttcattttt ataggacttc caggaaggat 
tctccagttg gcagatgact atgactactg 
ttctgtttga ccaccatgga gtcacccatt 
gaattgcaaa gcaggagtta gcgaagatct 
tctggctatg aaagtctgct tacaaggaag 
agtttgaaga caatgaggtt ttagctgcat ttgacatgaa 
atagctatgg tatttacgtg tctttttgct tagttactta 
gtatgaactc aggtctctcg ggctactggc atggattgat tacatacaac 
gcagtgattt agggtttatg agtacttttg cagtaaatca 
tcagggaaaa aaaaaaaaag ccaaccctga cagacatccc 
atcacagctc agtgcggtcc cagagaacac agggactctt 

gttagtcaga agactttcca 
ctccgctgca cagttccagt 



caggaacaaa 
taaccactgc 
tgaaacaaca 



ctgggaatat 
taataattct 
acagcacagt 
aataaatcta 



gaaattgaga cctctactgg 
ttgaccccag ctgaggtcaa 



agggcctcaa gataactgat 



aggcaatcct ggaattttct 
tggcactttt tgggtcaggc cgtgatccaa 
tgcctgaccg tcccaactca ctgcactcaa 
ttgaaattgc agtgtggccc agaggggctg 
ttaatcctca gcaagtgcaa tttgcagcca 
atcagtatca acaagtggtt tggcttggaa 
tactctctaa tggagttgca ttttgaagca 
ggctgctaaa cattagggtc aattttccag 
tgcaaagctg cccaaacata gcacttccaa 
tcttgccagc actgtccttc tcaaatgaac 
gtaacaagct ttgaatgtca ttaaaaagta 

* 

gcccactaga aacatcttgt acaagctgaa 
actttataca atcatagaat catagaatgg 
aagatccaac acccccgcca caggcagggc 
gcagcccagg gctccatcca acctggccat 
ctctctgggc agcctgtgcc agcacctcac 



ggagcagaag 
acaaaggcga 
caccagtact 
ttaaattgaa 
gatggagtct 
ggacactgtg 
tgcactttct 
ttgaatacaa 
tcaacaaaca 
tatctgcttt 
cactggggct 
cctgggttgg 
caccaacctc 
gaacacctcc 
caccctctct 



tagggttagt 
agctcaggtg 
ctcttaggac 
ttctggccac 
catcccagtt 
ttccagctat 
aaccacaaga 
ggattgacca 
ctaactgata 
aggggctcta 
aaaagctggc 
gaagtgtctg 
ttatatgcag 
atttcaaagt 
cagtagttca 
ccagattagt 
aagggacccc 
cagatctggt 
agggatggag 
gtgaagaact 



tgtaatttta 
aatgttaatc 
gaaatcaagg 
ctttatgtac 
agttcagctg 
tgtacagttc 
ggtcagggag 
gtggcttttg 
cgaggcaaca 
ctacaatgca 
caggagtagc 
ctcctaaaga 
cagttcccca 
gcgtactgct 
ctagtagaaa 
gcttatttat 
ggtaaaacct 
aaggatcatg 
actagaccag 
catccacaac 
tttccctgac 
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atccaatcta agccttccct ccttgaggtt agatccactc ccccttgtgc tatcactgtc 
tactcttgta aaaagttgat tctcctcctt tttggaaggt tgcaatgagg tctccttgca 
gccttcttct cttctgcagg atgaacaagc ccagctccct cagcctgtct ttataggaga 10980 
ggtgctccag ccctctgatc atctttgtgg ccctcctctg gacccgctcc aagagctcca 11040 • 
catctttcct gtactggggg ccccaggcct gaatgcagta ctccagatgg ggcctcaaaa 11100 
gagcagagta aagagggaca atcaccttcc tcaccctgct ggccagccct cttctgatgg 11160 
agccctggat acaactggct ttctgagctg caacttctco ttatcagttc cactattaaa 11220 
acaggaacaa tacaacaggt gctgatggcc agtgcagagt ttttcacact tottcatttc 11280 
ggtagatctt agatgaggaa cgttgaagtt gtgcttctgc gtgtgcttct tcctcctcaa 11340 
atactcctgc ctgatacctc accccacctg ccactgaatg gctccatggc cccctgcagc 11400 
oagggccctg atgaacccgg cactgcttca gatgctgttt aatagcacag tatgaccaag 11460 
ttgcacctat gaatacacaa acaatgtgtt gcatccttca gcacttgaga agaagagcca 11520 
aatttgcatt gtcaggaaat ggtttagtaa ttctgccaat taaaacttgt ttatctacca 11580 
tggctgtttt tatggctgtt agtagtggta cactgatgat gaacaatggc tatgcagtaa 11640 
aatcaagact gtagatattg caacagacta taaaattcct ctgtggctta gccaatgtgg 11700 
tacttcccac attgtataag aaatttggca agtttagagc aatgtttgaa gtgttgggaa 11760 
atttctgtat actcaagagg gcgtttttga caactgtaga acagaggaat caaaaggggg 11820 
tgggaggaag ttaaaagaag aggcaggtgc aagagagctt gcagtcccgc tgtgtgtacg 11880 
acactggcaa catgaggtct ttgctaatct tggtgctttg cttcctgccc ctggctgcct 11940 
tagggtgcga tctgcotcag acccacagcc tgggcagcag gaggaocctg atgctgctgg 12000 
ctcagatgag gagaatcagc ctgtttagct gcctgaagga taggcacgat tttggctttc 12060 
ctcaagagga gtttggcaac cagtttcaga aggctgagac catccctgtg ctgcacgaga 12120 
tgatccagca gatctttaao ctgtttagca ccaaggatag cagcgctgct tgggatgaga 12180 
ccctgctgga taagttttac accgagctgt acoagcagct gaacgatctg gaggcttgcg 12240 
tgatccaggg cgtgggcgtg accgagaccc ctctgatgaa ggaggatagc atcctggctg 12300 
tgaggaagta ctttcagagg atcaccctgt acctgaagga gaagaagtac agcccctgcg 12360 
cttgggaagt cgtgagggct gagatcatga ggagctttag cctgagcacc aacctgcaag 12420 
agagcttgag gtctaaggag taaaaagtct agagtcgggg cggcgcgtgg taggtggcgg 12480 
ggggttccca ggagagcccc cagcgcggac ggcagcgccg tcactcaccg ctccgtctcc 12540 
ctccgccoag ggtcgcctgg ogcaaccgct gcaagggcac cgacgtccag gcgtggatca 12600 
gaggctgccg gctgtgagga gctgccgcgc ccggcccgcc cgctgcacag ccggccgctt 12660 
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tgcgagcgcg acgctacccg cttggcagtt ttaaacgcat ccctcattaa aacgactata 12720 
cgcaaacgcc ttcccgtcgg tccgcgtctc tttccgccgc cagggcgaca ctcgcgggga 12730 
gggcgggaag ggggccgggc gggagcccgc ggccaaccgt cgccocgtga cggcaccgcc 12840 
ccgcccccgt gacgcggtgc gggcgccggg gccgtggggc tgagcgctgc ggcggggccg 12900 
ggccgggccg gggcgggagc tgagcgcggc gcggctgcgg gcggcgcccc ctccggtgoa 12960 
atatgttcaa gagaatggct gagttcgggc ctgactccgg gggcagggtg aaggtgcggc 13020 
gcgggcggag ggacggggcg ggcgcggggc cgcccggcgg gtgccggggc ctctgccggc 13080 
ccgcocggct cgggctgctg cggcgcttac gggcgcgctt ctcgccgctg ccgcttctct 13140 
tctctcccgc gcaagggcgt caccatcgtg aagccggtag tgtacgggaa cgtggcgcgg 13200 
tacttcggga agaagaggga ggaggacggg cacacgcatc agtggacggt ttacgtgaag 13260 
ccctacagga acgaggtagg gcccgagcgc gtcggccgcc gttctcggag cgccggagcc 
gtcagcgccg cgcctgggtg cgctgtggga cacagcgagc ttctctcgta ggacatgtcc 
gcctacgtga aaaaaatcca gttcaagctg cacgagagct acgggaatcc tctccgaggt 
gggtgttgcg tcggggggtt tgctccgctc ggtcccgctg aggctcgtcg ccctcatctt 
tctttcgtgc cgcagtcgtt accaaaccgc cgtacgagat caccgaaacg ggctggggcg 
aatttgaaat catcatcaag atatttttca ttgatccaaa cgagcgaccc gtaagtacgc 
tcagcttctc gtagtgcttc ccccgtcctg gcggcccggg gctgggctgc tcgctgctgc 
cggtcacagt cccgccagcc gcggagctga ctgagctccc tttcccggga cgtgtgctct 
gtgttcggtc agcgaggcta tcgggagggc tttggctgca tttggcttct ctggcgctta 
gcgcaggagc acgttgtgct acgcctgaac tacagctgtg agaaggccgt ggaaaccgct 
ctcaaactga tttattggcg aaatggctct aaactaaatc gtctcctcto tttggaaatg 
ctttagagaa ggtctctgtg gtagttctta tgcatctatc ctaaagcact tggccagaca 
atttaaagac atcaagcagc atttatagca ggcacgttta ataacgaata ctgaatttaa 14040 
gtaactctgc tcaogttgta tgacgtttat tttcgtattc ctgaaagcca ttaaaatcct 14100 
gtgcagttgt ttagtaagaa cagctgccac tgttttgtat ctaggagata actggtgttt 14160 
coctacagtt ctcaagctga taaaactctg tctttgtatc taggtaaccc tgtatcactt- 14220 
gctgaagctt tttcagtctg acaccaatgc aatcotggga aagaaaactg tagtttctga 14280 
attctatgat gaaatggtat gaaaatttta atgtcaaccg agcctgactt tatttaaaaa 14340 
aaattattga tggtgctgtg tattttggtc cttccttaga tatttoaaga tcctactgcc 14400 
atgatgcagc aactgctaac gacgtcccgt cagctgacac ttggtgctta caagcatgaa 14460 
acagagtgta agtgcaaaat gaggatacct tcgccgaccg tcattcacta ctaatgtttt 
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ctgtgggatg tgatcgtaca gtgagtttgg ctgtgtgaaa tttgaatagc ttggtattgg 14580 
cagtgatgac gtgatcgatg ccttgcttat catgtttgaa atgaagtaga ataaatgcag 14 640 
cctgctttat ttgagatagt ttggttcatt ttatggaatg caagcaaaga ttatacttcc 14700. 
tcactgaatt gcactgtcca aaggtgtgaa atgtgtgggg atctggagga ccgtgaccga 14760 
gggacattgg atcgctatct cccatttctt ttgctgttac cagttcagat tttcttttca 14820 
cctagtcttt aattcccagg gttttgtttt ttccttggtc atagtttttg tttttcactc 
tggcaaatga tgttgtgaat tacactgctt cagccacaaa actgatggac tgaatgaggt 
catcaaacaa acttttcttc ttccgtattt cctttttttt cccccactta tcatttttac 
tgctgttgtt gagtctgtaa ggctaaaagt aactgttttg tgctttttca ggacgtgtgc 
tttccaaatt actgccacat atataaagaa aggttggaat tttaaagata attcatgttt 
cttcttcttt tttgccacca cagttgcaga tcttgaagta aaaaccaggg aaaagctgga 
agctgccaaa aagaaaacoa gttttgaaat tgctgagctt aaagaaaggt taaaagcaag 
tcgtgaaacc atcaactgct taaagagtga aatcagaaaa ctcgaagagg atgatcagtc 
taaagatatg tgatgagtgt tgacttggca gggagcctat aatgagaatg aaaggacttc 
agtcgtggag ttgtatgcgt tctctccaat tctgtaacgg agactgtatg aatttcattt 
gcaaatcact gcagtgtgtg acaactgact ttttataaat ggcagaaaac aagaatgaat 
gtatcctcat tttatagtta aaatctatgg gtatgtactg gtttatttca aggagaatgg 
atcgtagaga cttggaggcc agattgctgc ttgtattgac tgcatttgag tggtgtagga 
acattttgtc tatggtcccg tgttagttta cagaatgcca ctgttcactg ttttgttttg 
tattttactt tttctactgc aacgtcaagg ttttaaaagt tgaaaataaa acatgcaggt 
tttttttaaa tatttttttg tctctatcca gtttgggctt caagtattat tgttaacagc 
aagtcctgat ttaagtcaga ggctgaagtg taatggtatt caagatgctt aagtctgttg 
tcagcaaaac aaaagagaaa acttcataaa atcaggaagt tggcatttct aataacttct 
ttatcaacag ataagagttt ctagccctgc atctactttc acttatgtag ttgatgcctt 15960 
tatattttgt gtgtttggat gcaggaagtg attcctactc tgttatgtag atattctatt 16020 
taacacttgt actctgctgt gcttagcctt tccccatgaa aattcagcgg ctgtaaatcc 16080 
ccctcttctt ttgtagcctc atacagatgg cagaccctca ggcttataaa ggcttgggca 16140 
tcttctttac tgctttgaga ttctgtgttg cagtaacctc tgccagagag gagaaaagcc 16200 
ccacaaacct catccccttc ttctatagca atcagtatta ctaatgcttt gagaacagag 16260 
cactggtttg aaacgtttga taattagcat ttaacatggc ttggtaaaga tgcagaactg 16320 
aaacagctgt gacagtatga actcagtatg gagacttcat taagacaaac agctgttaaa 16380 
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atcaggcatg tttcattgag gaggacgggg caacttgcac cagtggtgcc cacacaaatc 16440 
cttcctggcg ctgcagacca atttttctgg cattctgact gccgttgctg ctggtcacag 
agagcaacta tttttatcag ccacaggcaa tttgcttgta gtattttcca agtgttgtag 
gtaagtataa atgcatcggc tccagagcac tttgagtata cttattaaaa acataaatga 
aagacaaatt agctttgctt gggtgcacag aacattttta gttccagcct gctttttggt 
agaagccctc ttctgaggct agaactgact ttgacaagta gagaaactgg caacggagct 
attgctatcg aaggatcctt gttaacaaag ttaatcgtct tttaaggttt ggtttattca 
ttaaatttgc ttttaagctg tagctgaaaa agaacgtgct gtcttccatg caccaggtgg 
cagctctgtg caaagtgctc tctggtctca ccagcctttt aattgccggg attctggcac 
gtctgagagg gctcagactg gcttcgtttg tttgaacagc gtgtactgct ttctgtagac 
atggccggtt tctctcctgc agcttatgaa actgttcaca ctgaacacac tggaacaggt 
tgcc'caagga ggccgtggat gccccatccc tggaggcatt caaggccagg ctggatgtgg 
ctctgggcag cctggtctgg tggttggcga tcctgcacat agcagcgggg ttgaaactcg 
atgatcactg tggtcctttt caacccaggc tattctatga ttctatgatt caacagcaaa 
tcatatgtac tgagagagga aacaaacaca agtgctactg tttgcaagtt ttgttcattt 
ggtaaaagag tcaggtttta aaattcaaaa tctgtctggt tttggtgttt tttttttttt 17340 
atttattatt tctttggggt tctttttgat gctttatctt tctctgccag gactgtgtga 
caatgggaac gaaaaagaac atgccaggca ctgtcctgga ttgcacacgc tggttgcact 
cagtagcagg ctcagaactg ccagtctttc cacagtatta ctttctaaac ctaattttaa 
tagcgttagt agacttccat cactgggcag tgcttagtga atgctctgtg tgaacgtttt 
acttataagc atgttggaag ttttgatgtt cctggatgca gtagggaagg acagattagc 
tatgtgaaaa gtagattctg agtatcgggg ttacaaaaag tatagaaacg atgagaaatt 17700 
cttgttgtaa ctaattggaa tttctttaag cgttcactta tgctacattc atagtatttc 17760 
catttaaaag taggaaaagg taaaacgtga aatcgtgtga ttttcggatg gaacaccgcc 17820 
ttcctatgca cctgaccaac ttccagagga aaagcctatt gaaagccgag attaagccac 17880 
caaaagaact catttgcatt ggaatatgta gtatttgccc tcttcctccc gggtaattac 17940 
tatactttat agggtgctta tatgttaaat gagtggctgg cactttttat tctcacagct 18000 
gtggggaatt ctgtcctcta ggacagaaac aattttaatc tgttccactg gtgactgctt 18060 
tgtcagcact tccacctgaa gagatcaata cactcttcaa tgtctagttc tgcaacactt 18120 
ggcaaacctc acatcttatt tcatactctc ttcatgccta tgcttattaa agcaataatc 18180 
tgggtaattt ttgttttaat cactgtcctg accccagtga tgaccgtgtc ccacctaaag 18240 
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ctcaattcag gtcctgaatc tcttcaactc tctatagcta acatgaagaa tcttcaaaag 18300 

ttaggtctga gggacttaag gctaactgta gatgttgttg cctggttnct gtgctgaagg 
ccgtgtagta gttagagcat tcaacctcta g 



io-)cn 
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18391 



<210> 11 

<211> 586 

<212> DMA 

<213> Artificial sequence 
<220> 

<223> MDOT artificial promoter 



gtaccgggcc ccccctcgag gtgaatatcc aagaatgcag aactgcatgg aaagcagagc 
tgcaggcacg atggtgctga gccttagctg cttcctgctg ggagatgtgg atgcagagac 
gaatgaagga cctgtccctt actcccctca gcattctgtg ctatttaggg ttctaccaga 
gtccttaaga ggtttttttt ttttttggtc caaaagtctg tttgtttggt tttgaccact 
gagagcatgt gacacttgtc tcaagctatt aaccaagtgt ccagccaaaa tcgatgtcac 
aacttgggaa ttttccattt gaagcccctt gcaaaaacaa agagcacctt gcctgctcca 
gctcctggct gtgaagggtt ttggtgccaa agagtgaaag gcttcctaaa aatgggctga 
gccggggaag gggggcaact tgggggctat tgagaaacaa ggaaggacaa acagcgttag 
gtcattgctt ctgcaaacac agccagggct gctcctctat aaaaggggaa gaaagaggct 
ccgcagccat cacagaccca gaggggacgg tctgtgaatc aagctt 



<210> 12 
<211> 11 
<212> " PRT 

<213> Artificial sequence 
<220> 

<223> SV40 terminator 
<400> 12 

Cys Gly Gly Pro Lys Lys Lys Arg Lys Val Gly 
1 ~ 5 10 



<210> 13 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Lys051 

<400> 13 

tgcatccttc agcacttgag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
586 
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<210> 14 

<212> DNA 

<213> Artificial sequence 
<22Q> 

<223> Primer IFN-3rev 

<400> 14 20 
aactcctctt gaggaaagcc 

<210> 15 

<211> 34 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer LYSBSU 

<400> 15 34 
ccccccccta aggcagccag gggcaggaag caaa 

<210> 16 

<211> 12 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer SaltoNotI 

<400> 16 12 
tcgagcggcc gc 

<210> 17 
<211> 83 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human xnterferon 
2b-encoding nucleic acid 

aJggctttga cotttgcctt actggtggct ctcctggtgc tgagctgcaa gagcagctgc 60 
tctgtgggct gcgatctgcc tea 

<210> i8 
<211> 100 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation ^ 
of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

ga^cacagc otgggcagca ggaggaccct gatgetgetg gctcagatga ggagaatcag 60 
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100 



cctgtttagc tgcctgaagg ataggcacga ttttggcttt 

<210> 19 
<211> 62 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

crcaagagga gtttggcaac cagtttcaga aggctgagac catccctgtg ctgcacgaga 60 

tg 

<210> 20 
<211> 94 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223>primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

tccagcagat ctttaacctg tttagcacca aggatagcag cgctgcttgg gatgagaccc 60 
tgctggataa gttttacacc gagctgtacc agca 

<210> 21 
<211> 77 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

cJgaacgatc tggaggcttg cgtgatccag ggcgtgggcg tgaccgagac ccctctgatg 60 
aaggaggata gcatcct 

<210> 22 
<211> 82 
<212> DNA 

<213> Artificial Sequence 
<220> 

" <223> primer used in" the " formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 



<400> 22 



60 



gctgtgagga agtactttca gaggatcacc ctgtacctga aggagaagaa gtacagccct ^ 
tgcgcttggg aagtcgtgag gg 



<210> 23 
<211> 65 
<212> DNA 
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<213> Artificial Sequence 



<220> 

<223> primer used in the formation 

■ of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

ctgagatcat gaggagcttt agcctgagca ccaacctgca agagagcttg aggtctaagg 60 
agtaa 



<210> 24 

<211> 34 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

<400> 24 

cccaagcttt caccatggct ttgacctttg cctt 

<210> 25 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 



<400> 25 

atctgcctca gacccacag 

<210> 26 
<211> 26 
<212> DNA 

~<213> Artificial Sequence 
<220> 

<223> primer used in the formation ^ 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

<400> 26 

gattttggct ttcctcaaga ggagtt 

<210> 27 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

<400> 27 
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gcacgagatg atccagcaga t 

<210> 28 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

<400> 28 

atcgttcagc tgctggtaca 

<210> 29 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

<400> 29 

cctcacagcc aggatgctat 

<210> 30 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

<400> 30 

atgatctcag ccctcacgac 

<210> 31 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

<4.00> 31 

ctgtgggtct gaggcagat 

<210> 32 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
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2b-encoding nucleic acid 
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aactcctctt gaggaaagcc aaaatc 

<210> 33 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 



<400> 33 

atctgctgga tcatctcgtg c 21 

<210> 34 
<211> 36 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

<400> 34 

tgctctagac tttttactcc ttagacctca agctct 



<210> 35 

<211> 19 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> neo for-1 primer for detecting the interferon transgene 

* ■ • ■ • ■ 

<400> 35 

tggattgcac gcaggttct 

<210> 36 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> neo rev-1 primer for detecting the interferon transgene 



<400> 36 

gtgcccagtc atagccgaat 20 

<210> 37 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> FAM labeled NEO-PROBE1 for detecting the interferon transgene 
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<400> 37 

cctctccacc caagcggccg 

<210> 38 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the synthesis of 
<400> 38 

tcactcgagg tgaatatcca agaat 

<210> 39 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the synthesis of 
<400> 39 

gagatcgatt ttggctggac acttg 

<210> 40 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the synthesis of 

<400> 40 

cacatcgatg tcacaacttg ggaat 

<210> 41 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the synthesis of 
<400> 41 

tctaagcttc gtcacagacc gtccc 
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IT) (54) Title: PRODUCTION OF A TRANSGENIC AVIAN BY CYTOPLASMIC INJECTION 

o , 

^ (57) Abstract: This invention provides methods for the stable introduction of heterologous coding sequences into the genome of a 
bird and expressing the coding sequences to produce desired proteins or to alter the phenotype of the bird. The present invention 
provides preferred methods for introducing a transgene into the cytoplasm of avian embryonic cells by cytoplasmic microinjection. 
O The embryo then develops into a transgenic adult capable of expressing a heterologous protein and/or capable of generating a line 
^ of transgenic birds through breeding. Synthetic vectors and gene promoters useful in the methods are also provided by the present 
^ invention, as are transgenic birds that express heterologous protein and avian eggs containing heterologous protein. 
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